org.htmlparser.sax

Class Locator

Implemented Interfaces:
Locator

public class Locator
extends Object
implements Locator

Transforms character offsets into line and column in the HTML file.

Field Summary

protected Parser
mParser
Underlying parser object.

Constructor Summary

Locator(Parser parser)
Creates a locator for the given parser.

Method Summary

int
getColumnNumber()
Return the column number where the current document event ends.
int
getLineNumber()
Return the line number where the current document event ends.
String
getPublicId()
Return the public identifier for the current document event.
String
getSystemId()
Return the system identifier for the current document event.

Field Details

mParser

protected Parser mParser
Underlying parser object.

Constructor Details

Locator

public Locator(Parser parser)
Creates a locator for the given parser.
Parameters:
parser - The parser with the Page being accessed.

Method Details

getColumnNumber

public int getColumnNumber()
Return the column number where the current document event ends. This is one-based number of Java char values since the last line end.

Warning: The return value from the method is intended only as an approximation for the sake of diagnostics; it is not intended to provide sufficient information to edit the character content of the original XML document. For example, when lines contain combining character sequences, wide characters, surrogate pairs, or bi-directional text, the value may not correspond to the column in a text editor's display.

The return value is an approximation of the column number in the document entity or external parsed entity where the markup triggering the event appears.

If possible, the SAX driver should provide the line position of the first character after the text associated with the document event. The first column in each line is column 1.

Returns:
The column number, or -1 if none is available.

getLineNumber

public int getLineNumber()
Return the line number where the current document event ends. Lines are delimited by line ends, which are defined in the XML specification.

Warning: The return value from the method is intended only as an approximation for the sake of diagnostics; it is not intended to provide sufficient information to edit the character content of the original XML document. In some cases, these "line" numbers match what would be displayed as columns, and in others they may not match the source text due to internal entity expansion.

The return value is an approximation of the line number in the document entity or external parsed entity where the markup triggering the event appears.

If possible, the SAX driver should provide the line position of the first character after the text associated with the document event. The first line is line 1.

Returns:
The line number, or -1 if none is available.

getPublicId

public String getPublicId()
Return the public identifier for the current document event.

The return value is the public identifier of the document entity or of the external parsed entity in which the markup triggering the event appears.

Returns:
A string containing the public identifier, or null if none is available.

getSystemId

public String getSystemId()
Return the system identifier for the current document event.

The return value is the system identifier of the document entity or of the external parsed entity in which the markup triggering the event appears.

If the system identifier is a URL, the parser must resolve it fully before passing it to the application. For example, a file name must always be provided as a file:... URL, and other kinds of relative URI are also resolved against their bases.

Returns:
A string containing the system identifier, or null if none is available.

HTML Parser is an open source library released under LGPL. SourceForge.net