Class Summary | |
---|---|
AbstractNode | The concrete base class for all types of nodes (tags, text remarks). |
AndFilter | Accepts nodes matching all of its predicate filters (AND operation). |
AndFilterWrapper | Wrapper for AndFilters. |
AppletTag | AppletTag represents an <Applet> tag. |
Attribute | An attribute within a tag. |
Attributes | Provides access to the tag attributes. |
BaseHrefTag | BaseHrefTag represents an <Base> tag. |
BeanyBaby | Demo of beans. |
BodyTag | A Body Tag. |
Bullet | A bullet tag. |
BulletList | A bullet list tag. |
ChainedException | Support for chained exceptions in code that predates Java 1.4. |
CharacterReference | Structure to hold a character and it's equivalent entity reference kernel. |
CompositeTag | The base class for tags that have an end tag. |
CompositeTagScanner | The main scanning logic for nested tags. |
ConnectionManager | Handles proxies, password protected URLs and request properties including cookies. |
ConnectionMonitor | Interface for HTTP connection notification callbacks. |
Cookie | A HTTP cookie. |
CssSelectorNodeFilter | A NodeFilter that accepts nodes based on whether they match a CSS2 selector. |
Cursor | A bookmark in a page. |
DefaultParserFeedback | Default implementation of the HTMLParserFeedback interface. |
DefinitionList | A definition list tag (dl). |
DefinitionListBullet | A definition list bullet tag (either DD or DT). |
Div | A div tag. |
DoctypeTag | The HTML Document Declaration Tag can identify <!DOCTYPE> tags. |
EncodingChangeException | The encoding is changed invalidating already scanned characters. |
Feedback | Mediates between the feedback mechanism of the htmlparser and an error handler. |
FeedbackManager | Implementaiton of static methods that allow the parser to route various messages to any implementation of the HTMLParserFeedback interface. |
Filter | Base class for all filters. |
FilterBean | Extract nodes from a URL using a filter. |
FilterBuilder | The main program for the FilterBuilder programming system. |
FormTag | Represents a FORM tag. |
FrameSetTag | Identifies an frame set tag. |
FrameTag | Identifies a frame tag |
HasAttributeFilter | This class accepts all tags that have a certain attribute, and optionally, with a certain value. |
HasAttributeFilterWrapper | Wrapper for HasAttributeFilters. |
HasChildFilter | This class accepts all tags that have a child acceptable to the filter. |
HasChildFilterWrapper | Wrapper for HasChildFilters. |
HasParentFilter | This class accepts all tags that have a parent acceptable to another filter. |
HasParentFilterWrapper | Wrapper for HasParentFilters. |
HasSiblingFilter | This class accepts all tags that have a sibling acceptable to another filter. |
HasSiblingFilterWrapper | Wrapper for HasSiblingFilters. |
HeadingTag | A heading (h1 - h6) tag. |
HeadTag | A head tag. |
Html | A html tag. |
HtmlPage | |
HtmlTreeCellRenderer | Renderer for tree view of a NodeList. |
HtmlTreeModel | Quick and dirty tree model for HTML nodes. |
HttpHeader | Utility methods to display HTTP headers. |
HTMLLinkBean | Display the links from a URL. |
HTMLTextBean | Display the textual URL contents. |
ImageTag | Identifies an image tag. |
InputStreamSource | A source of characters based on an InputStream such as from a URLConnection. |
InputTag | An input tag in a form. |
IsEqualFilter | This class accepts only one specific node. |
IteratorImpl | |
JspScanner | Placeholder for yet to be written scanner for JSP tags. |
JspTag | The JSP/ASP tags like <%...%> can be identified by this class. |
LabelTag | A label tag. |
Lexer | This class parses the HTML stream into nodes. |
LinkBean | Extract links from a URL. |
LinkExtractor | LinkExtractor extracts all the links from the given webpage and prints them on standard output. |
LinkFindingVisitor | |
LinkRegexFilter | This class accepts tags of class LinkTag that contain a link matching a given regex pattern. |
LinkStringFilter | This class accepts tags of class LinkTag that contain a link matching a given pattern string. |
LinkTag | Identifies a link tag. |
Locator | Transforms character offsets into line and column in the HTML file. |
MetaTag | A Meta Tag |
Node | Specifies the minimum requirements for nodes returned by the Lexer or Parser. |
NodeClassFilter | This class accepts all tags of a given class. |
NodeClassFilterWrapper | Wrapper for NodeClassFilters. |
NodeFactory | This interface defines the methods needed to create new nodes. |
NodeFilter | Implement this interface to select particular nodes. |
NodeIterator | |
NodeList | |
NodeTreeWalker | A class for walking a tree of {@link Node} objects, in either a depth-first or breadth-first manner. |
NodeVisitor | The base class for the 'Visitor' pattern. |
NotFilter | Accepts all nodes not acceptable to it's predicate filter. |
NotFilterWrapper | Wrapper for NotFilters. |
NullLayoutManager | A layout manager that doesn't move things around. |
ObjectFindingVisitor | |
ObjectTag | ObjectTag represents an <Object> tag. |
OptionTag | An option tag within a form. |
Ordered | Describes an object that knows about ordering. |
OrFilter | Accepts nodes matching any of its predicates filters (OR operation). |
OrFilterWrapper | Wrapper for OrFilters. |
Page | Represents the contents of an HTML page. |
PageAttribute | An attribute within a tag on a page. |
PageIndex | A sorted array of integers, the positions of the first characters of each line. |
ParagraphTag | A paragraph (p) tag. |
Parser | The main parser class. |
ParserException | Library-specific support for chained exceptions. |
ParserFeedback | Interface for providing feedback without forcing the output destination to be predefined. |
ParserUtils | |
Picture | Class to track pictures within the frame. |
PicturePanel | Hold and display a group of pictures. |
ProcessingInstructionTag | The XML processing instructions like <? |
PrototypicalNodeFactory | A node factory based on the prototype pattern. |
RegexFilter | This filter accepts all string nodes matching a regular expression. |
RegexFilterWrapper | Wrapper for RegexFilters. |
Remark | This interface represents a comment in the HTML document. |
RemarkNode | The remark tag is identified and represented by this class. |
Scanner | Generic interface for scanning. |
ScriptDecoder | Decode script. |
ScriptScanner | The ScriptScanner handles script CDATA. |
ScriptTag | A script tag. |
SelectTag | A select tag within a form. |
Sequencer | Display received images at a constant rate. |
SimpleNodeIterator | The HTMLSimpleEnumeration interface is similar to NodeIterator, except that it does not throw exceptions. |
SiteCapturer | Save a web site locally. |
Sort | A quick sort algorithm to sort Vectors or arrays. |
Sortable | Provides a mechanism to abstract the sort process. |
Source | A buffered source of characters. |
Span | A span tag. |
Stream | Provides for asynchronous fetching from a stream. |
StringBean | Extract strings from a URL. |
StringExtractor | Extract plaintext strings from a web page. |
StringFilter | This class accepts all string nodes containing the given string. |
StringFilterWrapper | Wrapper for StringFilters. |
StringFindingVisitor | |
StringSource | A source of characters based on a String. |
StyleScanner | The StyleScanner handles style elements. |
StyleTag | A StyleTag represents a <style> tag. |
SubFilterList | A helper class for lists of filters within filters. |
Tabby | Replace tabs with spaces. |
TableColumn | A table column tag. |
TableHeader | A table header tag. |
TableRow | A table row tag. |
TableTag | A table tag. |
Tag | This interface represents a tag (<xxx yyy="zzz">) in the HTML document. |
TagFindingVisitor | |
TagNameFilter | This class accepts all tags matching the tag name. |
TagNameFilterWrapper | Wrapper for TagNameFilters. |
TagNode | TagNode represents a generic tag. |
TagScanner | TagScanner is an abstract superclass, subclassed to create specific scanners. |
Text | This interface represents a piece of the content of the HTML document. |
TextareaTag | A text area tag within a form. |
TextExtractingVisitor | Extracts text from a web page. |
TextNode | Normal text in the HTML document is represented by this class. |
Thumbelina | View images behind thumbnails. |
ThumbelinaFrame | Encapsulate a Thumbelina bean and add menu and preferences support. |
TileSet | Class to track picture regions. |
TitleTag | A title tag. |
Translate | Translate numeric character references and character entity references to unicode characters. |
UrlModifyingVisitor | |
VerticalLayoutManager | A layout manager like a vertical FlowLayout. |
WikiCapturer | Save a wikiwikiweb locally. |
XMLReader | SAX parser. |
XorFilter | Accepts nodes matching an odd number of its predicates filters (XOR operation). |
HTML Parser is an open source library released under LGPL. | |