This site will look much better in a browser that supports web standards, but it is accessible to any browser or Internet device.

The Open Source Zone


CyberNeko HTML Parser

Project Name CyberNeko HTML Parser
Category Libraries
Programming Language Java
Project Home Page http://people.apache.org/~andyc/neko/doc/html/index.html
License(s) Apache style
License URL(s) http://people.apache.org/~andyc/neko/LICENSE
Tags (edit) html, parser, xml

NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces. The parser can scan HTML files and "fix up" many common mistakes that human (and computer) authors make in writing HTML documents. NekoHTML adds missing parent elements; automatically closes elements with optional end tags; and can handle mismatched inline element tags.

NekoHTML is written using the Xerces Native Interface (XNI) that is the foundation of the Xerces2 implementation. This enables you to use the NekoHTML parser with existing XNI tools without modification or rewriting code.

Edit project.

Report abuse.

Similar Projects (edit)

Reviews

Be the first person to review this project!

User tips

Add a tip.

Related books

Add more books.


Last modified on 2006-08-19 11:13:02.357 (rel. 3).