tagsoup - A SAX-compliant HTML parser written in Java
Website: | http://home.ccil.org/~cowan/XML/tagsoup/ |
---|---|
License: | GPLv2+ or AFL |
Vendor: | Fedora Project |
- Description:
TagSoup is a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design. By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML.
Packages
tagsoup-1.0.1-2.2.fc10.i386 [143 KiB] |
Changelog
by Tom "spot" Callaway (2008-07-10):
- drop repotag - fix license tag |