Sponsorised links
October 2009
Whatpm::HTML - An HTML Parser and Serializer
Whatpm::HTML - An HTML Parser and Serializer
Sponsorised links
September 2009
August 2009
Character encoding detection for external scripts
This is (EF BB BF) C3 B6 3D 22 21 22 loaded into browsers under various labels. That happens to be properly formed ECMAScript code for all the encodings used. The bogus results for Opera9 can easily be reproduced in context of the testing script, but probably not individually from a clean cache; what's going on there is unknown. I also noted in running these tests that Opera claims "Opera supports the entire ECMA-262 2nd and 3rd standards with no exceptions" while in fact their implementation does not, the parser rejects code that follows the IdentifierStart :: UnicodeEscapeSequence production of ECMA-262 section 7.6. Instead it implements Opera-only extensions, like comma-free arrays ala [ 1 2 3 ]. Other fun facts include: IE does not implement onload for iframes and cannot modify the innerHTML or tr elements; Firefox ignores "tags" when setting the innerHTML of dynamically created tr elements with no ownerElement... Oh and Opera again needs /th "tags" so it won't nest adjacent th elements when setting innerHTML.
RDFa Fragment Parser
Paste a chunk of XHTML RDFa below, and click "Parse."
make sure you do the right thing for RDFa validation when you eventually place this chunk inside a web page
Python Package Index : finaloption 0.9.6
command line parsing done right
Finaloption is a command line parser, intended to make writing command line applications easy and painless. It uses built-in Python types (lists, dictionaries, etc) to define options, which makes configuration clear and concise. Additionally it contains possibility to handle subcommands (i.e. hg commit or svn update).
July 2009
fyzz (fyzz is a sparkling Python parser for the Sparql query language) (Logilab.org)
fyzz is a sparkling Python parser for the Sparql query language
John Resig - HTML 5 Parsing
If you're interested in giving the new parser a try (it's doubtful that you'll see many obvious changes - but any help in hunting down bugs would be appreciated) you can download a nightly of Firefox, open about:config, and set html5.enable to true.
May 2009
Python Package Index : pyWxSVG 0.1
View and print svg file or svg content, convert svg to raster graphics. Partial support svg format. Tested with Python 2.5 and wxPython 2.8.9.2. Drawing use wx.GraphicsContext class. Path parser from Enable - SVGPathParser class.
April 2009
March 2009
Beautiful Soup: We called him Tortoise because he taught us.
RFC (2)822 & 3696 Email Address Parser in PHP
The test suite shows results for each parser, based on these test definitions. These are borrowed from Dominic Sayers who has a similar parser. We are still arguing over certain tests ;)
February 2009
Les parsers HTML5 - La Tortue Cynique / The Cynical Turtle
Bref, on a donc besoin d'un parser spécifique (après 30 ans à travailler avec des parsers génériques GML et SGML),
January 2009
Urfist Info: Parser biblio
December 2008
Bug 6298 – Provide a parser override
The spec uses "XHTML5" and so does Validator.nu. The term is pretty widely used on the Web.
Occurences du mot XHTML5 dans Google en prenant soin d'enlever les protagonistes du WhatWG, les aggrégateurs de contenu et les archivages des listes de discussions ? 3000. La bonne parole du WhatWG, ça ferait presque regretter les témoins de Jéhovah.
November 2008
Source de la classe PHP XMLClassDoc
PHP Simple HTML DOM Parser
- A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way!
- Require PHP 5+.
- Supports invalid HTML.
- Find tags on an HTML page with selectors just like jQuery.
- Extract contents from HTML in a single line.
PHP Simple HTML DOM Parser
- A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way!
- Require PHP 5+.
- Supports invalid HTML.
- Find tags on an HTML page with selectors just like jQuery.
- Extract contents from HTML in a single line.
