HTML Parser Home Page | |
Prev Class | Next Class | Frames | No Frames |
Summary: Nested | Field | Method | Constr | Detail: Nested | Field | Method | Constr |
java.lang.Object
org.htmlparser.scanners.TagScanner
org.htmlparser.scanners.CompositeTagScanner
org.htmlparser.scanners.ScriptScanner
public class ScriptScanner
extends CompositeTagScanner
Field Summary | |
static boolean |
|
Constructor Summary | |
|
Methods inherited from class org.htmlparser.scanners.CompositeTagScanner | |
addChild , createVirtualEndTag , finishTag , isTagToBeEndedFor , scan |
Methods inherited from class org.htmlparser.scanners.TagScanner | |
scan |
public static boolean STRICT
Strict parsing of CDATA flag. If this flag is set true, the parsing of script is performed without regard to quotes. This means that erroneous script such as:document.write("</script>");will be parsed in strict accordance with appendix B.3.2 Specifying non-HTML data of the HTML 4.01 Specification and hence will be split into two or more nodes. Correct javascript would escape the ETAGO:document.write("<\/script>");If true, CDATA parsing will stop at the first ETAGO ("</") no matter whether it is quoted or not. If false, balanced quotes (either single or double) will shield an ETAGO. Beacuse of the possibility of quotes within single or multiline comments, these are also parsed. In most cases, users prefer non-strict handling since there is so much broken script out in the wild.
public Tag scan(Tag tag, Lexer lexer, NodeList stack) throws ParserException
Scan for script. Accumulates text from the page, until </[a-zA-Z] is encountered.
- Overrides:
- scan in interface CompositeTagScanner
- Parameters:
tag
- The tag this scanner is responsible for.lexer
- The source of CDATA.stack
- The parse stack, not used.
© 2005 Derrick Oswald Mai 08, 2008 |
HTML Parser is an open source library released under LGPL. | |