public marks

PUBLIC MARKS from dcancel with tags screenscraping & webcrawler

22 August 2006 22:15

Ariel

a library that allows you to extract information from semi-structured documents (such as websites). Ariel will use a small number of labeled examples to generate and learn effective extraction rules.

dcancel's TAGS related to tag screenscraping

javascript +   rss +   ruby +   rueble +   web +   webcrawler +   xml +