Sponsorised links
This month
Python, Unicode and UnicodeDecodeError
October 2009
Sponsorised links
September 2009
August 2009
Character encoding detection for external scripts
This is (EF BB BF) C3 B6 3D 22 21 22 loaded into browsers under various labels. That happens to be properly formed ECMAScript code for all the encodings used. The bogus results for Opera9 can easily be reproduced in context of the testing script, but probably not individually from a clean cache; what's going on there is unknown. I also noted in running these tests that Opera claims "Opera supports the entire ECMA-262 2nd and 3rd standards with no exceptions" while in fact their implementation does not, the parser rejects code that follows the IdentifierStart :: UnicodeEscapeSequence production of ECMA-262 section 7.6. Instead it implements Opera-only extensions, like comma-free arrays ala [ 1 2 3 ]. Other fun facts include: IE does not implement onload for iframes and cannot modify the innerHTML or tr elements; Firefox ignores "tags" when setting the innerHTML of dynamically created tr elements with no ownerElement... Oh and Opera again needs /th "tags" so it won't nest adjacent th elements when setting innerHTML.
Flickr - Sauvegarde/export globale des données(métadonnées) - Wiki URFIST
BabelStone : Software : BabelPad (Unicode Text Editor for Windows)
July 2009
June 2009
Ascii Table - ASCII character codes and html, octal, hex and decimal charts
May 2009
March 2009
Understanding Bidirectional (BIDI) Text in Unicode
January 2009
UTF-8 Browser Test for Unicode Block 'Miscellaneous Symbols'
URDU Unicode Utility - About
December 2008
The Voidspace Techie Blog
If we are only ever using this file on the current system then maybe we don't need to worry, but if we ever need to read data that might have been created on another system then we had better know what encoding was used. The solution: the open function takes an optional encoding parameter:
Blog Stéphane Bortzmeyer: Mon exposé à Sparkling Point sur les conséquences politiques des choix techniques
un exposé sur « Les conséquences politiques des choix techniques », suivant l'observation de Lawrence Lessig que « The code is the law » (l'architecture - d'un système technique - est la loi).
Je développe trois exemples, empruntés au monde de l'Internet, les politiques d'allocations d'adresses IP, la révision de la norme sur les noms de domaines en Unicode et les débats sur la nouvelle architecture de routage et d'adressage de l'Internet.
November 2008
