| Project Name | HTMLParser |
| Category | Libraries |
| Programming Language | Java |
| Project Home Page | http://htmlparser.sourceforge.net/ |
| License(s) | LGPL |
| License URL(s) | http://www.gnu.org/copyleft/lgpl.html |
| Tags (edit) | html, parser |
HTMLParser is a super-fast real-time parser for real-world HTML. What has attracted most developers to HTMLParser has been its simplicity in design, speed and ability to handle streaming real-world html.
The two fundamental use-cases that are handled by the parser are extraction and transformation (the syntheses use-case, where HTML pages are created from scratch, is better handled by other tools closer to the source of data). While prior versions concentrated on data extraction from web pages, Version 1.4 of the HTMLParser has substantial improvements in the area of transforming web pages, with simplified tag creation and editing, and verbatim toHtml() method output.
Similar Projects (edit)
Reviews
Be the first person to review this project!
User tips
Related books
Last modified on 2006-02-27 23:00:40.628 (rel. 5).
