org.cyberneko.html
public class HTMLConfiguration extends ParserConfigurationSettings implements XMLPullParserConfiguration
This configuration recognizes the following features:
This configuration recognizes the following properties:
For complete usage information, refer to the documentation.
Version: $Id: HTMLConfiguration.java,v 1.9 2005/02/14 03:56:54 andyc Exp $
See Also: HTMLScanner HTMLTagBalancer
Nested Class Summary | |
---|---|
protected class | HTMLConfiguration.ErrorReporter
Defines an error reporter for reporting HTML errors. |
Field Summary | |
---|---|
protected static String | AUGMENTATIONS Include infoset augmentations. |
protected static String | BALANCE_TAGS Balance tags. |
protected static String | ERROR_DOMAIN Error domain. |
protected static String | ERROR_REPORTER Error reporter. |
protected boolean | fCloseStream
Stream opened by parser. |
protected XMLDocumentHandler | fDocumentHandler Document handler. |
protected HTMLScanner | fDocumentScanner Document scanner. |
protected XMLDTDContentModelHandler | fDTDContentModelHandler DTD content model handler. |
protected XMLDTDHandler | fDTDHandler DTD handler. |
protected XMLEntityResolver | fEntityResolver Entity resolver. |
protected XMLErrorHandler | fErrorHandler Error handler. |
protected HTMLErrorReporter | fErrorReporter Error reporter. |
protected Vector | fHTMLComponents Components. |
protected Locale | fLocale Locale. |
protected NamespaceBinder | fNamespaceBinder Namespace binder. |
protected HTMLTagBalancer | fTagBalancer HTML tag balancer. |
protected static String | FILTERS Pipeline filters. |
protected static String | NAMESPACES Namespaces. |
protected static String | NAMES_ATTRS Modify HTML attribute names: { "upper", "lower", "default" }. |
protected static String | NAMES_ELEMS Modify HTML element names: { "upper", "lower", "default" }. |
protected static String | REPORT_ERRORS Report errors. |
protected static String | SIMPLE_ERROR_FORMAT Simple report format. |
protected static boolean | XERCES_2_0_0 Parser version is Xerces 2.0.0. |
protected static boolean | XERCES_2_0_1 Parser version is Xerces 2.0.1. |
protected static boolean | XML4J_4_0_x Parser version is XML4J 4.0.x. |
Constructor Summary | |
---|---|
HTMLConfiguration() Default constructor. |
Method Summary | |
---|---|
protected void | addComponent(HTMLComponent component) Adds a component. |
void | cleanup()
If the application decides to terminate parsing before the xml document
is fully parsed, the application should call this method to free any
resource allocated during parsing. |
XMLDocumentHandler | getDocumentHandler() Returns the document handler. |
XMLDTDContentModelHandler | getDTDContentModelHandler() Returns the DTD content model handler. |
XMLDTDHandler | getDTDHandler() Returns the DTD handler. |
XMLEntityResolver | getEntityResolver() Returns the entity resolver. |
XMLErrorHandler | getErrorHandler() Returns the error handler. |
Locale | getLocale() Returns the locale. |
void | parse(XMLInputSource source) Parses a document. |
boolean | parse(boolean complete)
Parses the document in a pull parsing fashion.
|
void | pushInputSource(XMLInputSource inputSource)
Pushes an input source onto the current entity stack. |
protected void | reset() Resets the parser configuration. |
void | setDocumentHandler(XMLDocumentHandler handler) Sets the document handler. |
void | setDTDContentModelHandler(XMLDTDContentModelHandler handler) Sets the DTD content model handler. |
void | setDTDHandler(XMLDTDHandler handler) Sets the DTD handler. |
void | setEntityResolver(XMLEntityResolver resolver) Sets the entity resolver. |
void | setErrorHandler(XMLErrorHandler handler) Sets the error handler. |
void | setFeature(String featureId, boolean state) Sets a feature. |
void | setInputSource(XMLInputSource inputSource)
Sets the input source for the document to parse.
|
void | setLocale(Locale locale) Sets the locale. |
void | setProperty(String propertyId, Object value) Sets a property. |
Parameters: complete True if the pull parser should parse the remaining document completely.
Returns: True if there is more document to parse.
Throws: XNIException Any XNI exception, possibly wrapping another exception. IOException An IO exception from the parser, possibly from a byte stream or character stream supplied by the parser.
See Also: HTMLConfiguration
Hint: To use this feature to insert the output of <SCRIPT> tags, remember to buffer the entire output of the processed instructions before pushing a new input source. Otherwise, events may appear out of sequence.
Parameters: inputSource The new input source to start scanning.
Parameters: inputSource The document's input source.
Throws: XMLConfigurationException Thrown if there is a configuration error when initializing the parser. IOException Thrown on I/O error.
See Also: HTMLConfiguration