org.htmlparser.sax

Class Locator

public class Locator extends Object implements Locator

Transforms character offsets into line and column in the HTML file.
Field Summary
protected ParsermParser
Underlying parser object.
Constructor Summary
Locator(Parser parser)
Creates a locator for the given parser.
Method Summary
intgetColumnNumber()
Return the column number where the current document event ends.
intgetLineNumber()
Return the line number where the current document event ends.
StringgetPublicId()
Return the public identifier for the current document event.
StringgetSystemId()
Return the system identifier for the current document event.

Field Detail

mParser

protected Parser mParser
Underlying parser object.

Constructor Detail

Locator

public Locator(Parser parser)
Creates a locator for the given parser.

Parameters: parser The parser with the {@link org.htmlparser.lexer.Page Page} being accessed.

Method Detail

getColumnNumber

public int getColumnNumber()
Return the column number where the current document event ends. This is one-based number of Java char values since the last line end.

Warning: The return value from the method is intended only as an approximation for the sake of diagnostics; it is not intended to provide sufficient information to edit the character content of the original XML document. For example, when lines contain combining character sequences, wide characters, surrogate pairs, or bi-directional text, the value may not correspond to the column in a text editor's display.

The return value is an approximation of the column number in the document entity or external parsed entity where the markup triggering the event appears.

If possible, the SAX driver should provide the line position of the first character after the text associated with the document event. The first column in each line is column 1.

Returns: The column number, or -1 if none is available.

See Also: Locator

getLineNumber

public int getLineNumber()
Return the line number where the current document event ends. Lines are delimited by line ends, which are defined in the XML specification.

Warning: The return value from the method is intended only as an approximation for the sake of diagnostics; it is not intended to provide sufficient information to edit the character content of the original XML document. In some cases, these "line" numbers match what would be displayed as columns, and in others they may not match the source text due to internal entity expansion.

The return value is an approximation of the line number in the document entity or external parsed entity where the markup triggering the event appears.

If possible, the SAX driver should provide the line position of the first character after the text associated with the document event. The first line is line 1.

Returns: The line number, or -1 if none is available.

See Also: Locator

getPublicId

public String getPublicId()
Return the public identifier for the current document event.

The return value is the public identifier of the document entity or of the external parsed entity in which the markup triggering the event appears.

Returns: A string containing the public identifier, or null if none is available.

See Also: Locator

getSystemId

public String getSystemId()
Return the system identifier for the current document event.

The return value is the system identifier of the document entity or of the external parsed entity in which the markup triggering the event appears.

If the system identifier is a URL, the parser must resolve it fully before passing it to the application. For example, a file name must always be provided as a file:... URL, and other kinds of relative URI are also resolved against their bases.

Returns: A string containing the system identifier, or null if none is available.

See Also: Locator

HTML Parser is an open source library released under LGPL. SourceForge.net