This site will look much better in a browser that supports web standards, but it is accessible to any browser or Internet device.

The Open Source Zone


HTMLParser

Project logo
Project Name HTMLParser
Category Libraries
Programming Language Java
Project Home Page http://htmlparser.sourceforge.net/
License(s) LGPL
License URL(s) http://www.gnu.org/copyleft/lgpl.html
Tags (edit) html, parser

HTMLParser is a super-fast real-time parser for real-world HTML. What has attracted most developers to HTMLParser has been its simplicity in design, speed and ability to handle streaming real-world html.

The two fundamental use-cases that are handled by the parser are extraction and transformation (the syntheses use-case, where HTML pages are created from scratch, is better handled by other tools closer to the source of data). While prior versions concentrated on data extraction from web pages, Version 1.4 of the HTMLParser has substantial improvements in the area of transforming web pages, with simplified tag creation and editing, and verbatim toHtml() method output.

Edit project.

Report abuse.

Similar Projects (edit)

Reviews

Be the first person to review this project!

User tips

Add a tip.

Related books

Add more books.


Last modified on 2006-02-27 23:00:40.628 (rel. 5).