2011
2010
2009
Character encoding detection for external scripts
This is (EF BB BF) C3 B6 3D 22 21 22 loaded into browsers under various labels. That happens to be properly formed ECMAScript code for all the encodings used. The bogus results for Opera9 can easily be reproduced in context of the testing script, but probably not individually from a clean cache; what's going on there is unknown. I also noted in running these tests that Opera claims "Opera supports the entire ECMA-262 2nd and 3rd standards with no exceptions" while in fact their implementation does not, the parser rejects code that follows the IdentifierStart :: UnicodeEscapeSequence production of ECMA-262 section 7.6. Instead it implements Opera-only extensions, like comma-free arrays ala [ 1 2 3 ]. Other fun facts include: IE does not implement onload for iframes and cannot modify the innerHTML or tr elements; Firefox ignores "tags" when setting the innerHTML of dynamically created tr elements with no ownerElement... Oh and Opera again needs /th "tags" so it won't nest adjacent th elements when setting innerHTML.
2008
FAQ xHTML/CSS - Charset Iso-8859-1, iso-8859-15, utf-8, lequel choisir ?
Stocker du texte UTF-8 correctement dans une base MySQL - Le blog de jp.fox, un informaticien à la campagne...
2007
Setting charset in htaccess
2006
ongoing · Characters vs. Bytes
Character Conversions from Browser to Database
charset-detector:自動偵測文件編碼的小程式
Essential Character Encoding Resources
ASCII Character Set
2005













