PUBLIC   marks

PUBLIC MARKS with tag parser

Sponsorised links

November 2008

PHP Simple HTML DOM Parser

by srcmax & 8 others , 3 comments
  • A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way!
  • Require PHP 5+.
  • Supports invalid HTML.
  • Find tags on an HTML page with selectors just like jQuery.
  • Extract contents from HTML in a single line.

PHP Simple HTML DOM Parser

by Spone & 8 others , 3 comments
# A HTML DOM parser written in PHP5 let you manipulate HTML in a very easy way! # Require PHP 5 . # Supports invalid HTML. # Find tags on an HTML page with selectors just like jQuery. # Extract contents from HTML in a single line.

October 2008

Sponsorised links

August 2008

HTML5 parser integrated in W3C Markup Validator from olivier Thereaux on 2008-08-25 (www-validator@w3.org from August 2008)

by Xavier Lacot
As part of an effort to promote the current work on HTML to web developers, Olivier Thereaux has been working lately on integrating the W3C Markup Validator with the HTML5 parser and checker developed over the past few years by Henri Sivonen.

CSSPool

by karlcow

CSSPool (pronounced "cesspool") is a CSS parser. It is a SAC parser (similar to SAX), and it validates.

Aeracode :: Graphs, Python and CSS

by karlcow

I could only find one python CSS library, cssutils, and while that seemed to have very decent CSS2 support for parsing into a document tree, I couldn't see any immediate way of using it for retrieving the applicable values for, say, a grid object with class "minor" inside a wavegraph object.

firefox mozilla/parser/htmlparser/src/nsHTMLTokenizer.cpp

by karlcow
Les règles de parsing sont là pour Mozilla. Merci damien. Séance de fous rires dans les commentaires du programme

The Performance Cost of the HTML Tree Builder

by karlcow 4 comments

Xerces is faster. Namespaces are worse than the much-maligned HTML “extra fix-ups” (21% hit vs. 7% hit). An XML parser can be slow.

June 2008

May 2008

Parser Generators

by YukuanBlog
在軟體開發過程,我們很可能得寫大量的程式碼來完成一些繁瑣、平凡的工作,避開這個窠臼的辦法就是「自動化」。誠如 Kernighan 和 Pike 在 The practice of programming 一書所闡述的,優秀的軟體設計運用幾個基本原則:簡單(simplicity)、清晰(clarity)、一般性(generality)、自動化(automation)。

John Resig - Pure JavaScript HTML Parser

by karlcow & 3 others , 2 comments

(I also contemplated porting the HTML 5 parser, wholesale, but that seemed like a herculean effort.)

html5 parsing difficile à implémenter ?

April 2008

Ian Bicking: a blog :: Python HTML Parser Performance

by karlcow

a performance comparison of several parsers and document models. The situation is a little complex because there’s different steps in handling HTML: 1. Parse the HTML 2. Parse it into something (a document object) 3. Serialize it

March 2008

Messages in a bottle » Blog Archive » Grune and Jacobs, Parsing Techniques, Second Edition

by karlcow

The second edition of Parsing techniques: A practical guide, by Dick Grune and Ceriel J. H. Jacobs, has now appeared.

February 2008

PHP Simple HTML DOM Parser

by Xavier Lacot & 8 others , 3 comments
A simple PHP HTML DOM parser written in PHP5+. I wrote it is because using PHP DOM XML library to parse HTML is really a pain. The origional idea is from Jose Solorzano's HTML Parser for PHP 4.

PHP Simple HTML DOM Parser

by nicolas tehu & 8 others , 3 comments
A simple PHP HTML DOM parser written in PHP5 . I wrote it is because using PHP DOM XML library to parse HTML is really a pain. The origional idea is from Jose Solorzano's HTML Parser for PHP 4.

January 2008

librdfa - a pure C RDFa parser from Manu Sporny on 2008-01-31 (public-rdf-in-xhtml-tf@w3.org from January 2008)

by karlcow

librdfa is a pure C implementation of a standards-compliant RDFa parser. The library is quite easy to use (there are only 5 functions). librdfa is stream-based, very small and quite fast.

PUBLIC TAGS related to tag parser

no tag

Sponsorised links