A B C D E F G H I J K L M N O P R S T U V W X _ 
All Classes All Packages

A

addAttribute(String, String) - Method in class org.htmlcleaner.TagNode
Deprecated.
Use setAttribute instead Adds specified attribute to this tag or overrides existing one.
addAttributesToTag(TagNode, Map) - Method in class org.htmlcleaner.HtmlCleaner
Add attributes from specified map to the specified tag.
addAttributeTransformation(String) - Method in class org.htmlcleaner.TagTransformation
Adds new attribute transformation in which destination attrbute will not exists (simply removes it from list of attributes).
addAttributeTransformation(String, String) - Method in class org.htmlcleaner.TagTransformation
Adds new attribute transformation to this tag transformation.
addChild(Object) - Method in class org.htmlcleaner.TagNode
 
addChildren(List) - Method in class org.htmlcleaner.TagNode
Add all elements from specified list to this node.
addEntity(String, int) - Static method in class org.htmlcleaner.SpecialEntity
Add new entity to the set.
addItemForMoving(BaseToken) - Method in class org.htmlcleaner.TagNode
 
addNamespaceDeclaration(String, String) - Method in class org.htmlcleaner.TagNode
Adds namespace declaration to the node
addPossibleHeadCandidate(TagInfo, TagNode, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Checks if specified tag with specified info is candidate for moving to head section.
addSavedAsContent() - Method in class org.htmlcleaner.HtmlTokenizer
 
addTag(String, int) - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
 
addTagInfo(TagInfo) - Method in class org.htmlcleaner.DefaultTagProvider
Sets new tag info.
addText(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
addToken(BaseToken) - Method in class org.htmlcleaner.HtmlTokenizer
 
addTransformation(TagTransformation) - Method in class org.htmlcleaner.CleanerTransformations
Adds specified tag transformation to the collection.
advancedxmlescape - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
advancedXmlEscape - Variable in class org.htmlcleaner.CleanerProperties
 
allowhtmlinsideattributes - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
allowHtmlInsideAttributes - Variable in class org.htmlcleaner.CleanerProperties
 
allowmultiwordattributes - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
allowMultiWordAttributes - Variable in class org.htmlcleaner.CleanerProperties
 
allowsAnything() - Method in class org.htmlcleaner.TagInfo
 
allowsBody() - Method in class org.htmlcleaner.TagInfo
 
allowsItem(BaseToken) - Method in class org.htmlcleaner.TagInfo
 
allTags - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
 
attName - Variable in class org.htmlcleaner.TagNode.TagNodeAttExistsCondition
 
attName - Variable in class org.htmlcleaner.TagNode.TagNodeAttValueCondition
 
attributes - Variable in class org.htmlcleaner.TagNode
 
attributeTransformations - Variable in class org.htmlcleaner.TagTransformation
 
attributeValue() - Method in class org.htmlcleaner.HtmlTokenizer
Parses a single tag attribute - it is expected to be in one of the forms: name=value name="value" name='value' name
attValue - Variable in class org.htmlcleaner.TagNode.TagNodeAttValueCondition
 

B

BaseToken - Interface in org.htmlcleaner
Base token interface.
belongsTo - Variable in class org.htmlcleaner.TagInfo
 
BODY - Static variable in class org.htmlcleaner.TagInfo
 
bodyNode - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
 
BOOL_ATT_EMPTY - Static variable in class org.htmlcleaner.CleanerProperties
 
BOOL_ATT_SELF - Static variable in class org.htmlcleaner.CleanerProperties
 
BOOL_ATT_TRUE - Static variable in class org.htmlcleaner.CleanerProperties
 
booleanAttributeValues - Variable in class org.htmlcleaner.CleanerProperties
 
booleanatts - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
BrowserCompactXmlSerializer - Class in org.htmlcleaner
Broswer compact XML serializer - creates resulting XML by stripping whitespaces wherever possible, but preserving single whitespace where at least one exists.
BrowserCompactXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.BrowserCompactXmlSerializer
 

C

calculateRootNode(HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Assigns root node to internal variable.
characters(char[], int, int) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
children - Variable in class org.htmlcleaner.TagNode
 
childTags - Variable in class org.htmlcleaner.TagInfo
 
clean(File) - Method in class org.htmlcleaner.HtmlCleaner
 
clean(File, String) - Method in class org.htmlcleaner.HtmlCleaner
 
clean(InputStream) - Method in class org.htmlcleaner.HtmlCleaner
 
clean(InputStream, String) - Method in class org.htmlcleaner.HtmlCleaner
 
clean(Reader) - Method in class org.htmlcleaner.HtmlCleaner
 
clean(Reader, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Basic version of the cleaning call.
clean(String) - Method in class org.htmlcleaner.DoctypeToken
 
clean(String) - Method in class org.htmlcleaner.HtmlCleaner
 
clean(URL) - Method in class org.htmlcleaner.HtmlCleaner
Creates instance from the content downloaded from specified URL.
clean(URL, String) - Method in class org.htmlcleaner.HtmlCleaner
 
CleanerProperties - Class in org.htmlcleaner
Properties defining cleaner's behaviour
CleanerProperties() - Constructor for class org.htmlcleaner.CleanerProperties
 
CleanerTransformations - Class in org.htmlcleaner
Contains transformation collection.
CleanerTransformations() - Constructor for class org.htmlcleaner.CleanerTransformations
 
CleanTimeValues() - Constructor for class org.htmlcleaner.HtmlCleaner.CleanTimeValues
 
closeAll(List<BaseToken>, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Close all unclosed tags if there are any.
closeSnippet(List, HtmlCleaner.TagPos, Object, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
collectNamespacePrefixesOnPath(Set<String>) - Method in class org.htmlcleaner.TagNode
Collect all prefixes in namespace declarations up the path to the document root from the specified node
CommandLine - Class in org.htmlcleaner
Command line usage class.
CommandLine() - Constructor for class org.htmlcleaner.CommandLine
 
comment() - Method in class org.htmlcleaner.HtmlTokenizer
 
CommentNode - Class in org.htmlcleaner
HTML comment token.
CommentNode(String) - Constructor for class org.htmlcleaner.CommentNode
 
commonStr - Variable in class org.htmlcleaner.HtmlTokenizer
 
CompactHtmlSerializer - Class in org.htmlcleaner
Compact HTML serializer - creates resulting HTML by stripping whitespaces wherever possible.
CompactHtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.CompactHtmlSerializer
 
CompactXmlSerializer - Class in org.htmlcleaner
Compact XML serializer - creates resulting XML by stripping whitespaces.
CompactXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.CompactXmlSerializer
 
ConfigFileTagProvider - Class in org.htmlcleaner
Default tag provider - reads XML file in specified format and creates tag infos
ConfigFileTagProvider() - Constructor for class org.htmlcleaner.ConfigFileTagProvider
 
ConfigFileTagProvider(File) - Constructor for class org.htmlcleaner.ConfigFileTagProvider
 
ConfigFileTagProvider(URL) - Constructor for class org.htmlcleaner.ConfigFileTagProvider
 
ConfigFileTagProvider(InputSource) - Constructor for class org.htmlcleaner.ConfigFileTagProvider
 
ConfigFileTagProvider.ConfigParser - Class in org.htmlcleaner
SAX parser for tag configuration files.
ConfigParser(Map) - Constructor for class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
content - Variable in class org.htmlcleaner.CommentNode
 
content - Variable in class org.htmlcleaner.ContentNode
 
content() - Method in class org.htmlcleaner.HtmlTokenizer
 
CONTENT_ALL - Static variable in class org.htmlcleaner.TagInfo
 
CONTENT_NONE - Static variable in class org.htmlcleaner.TagInfo
 
CONTENT_TEXT - Static variable in class org.htmlcleaner.TagInfo
 
ContentNode - Class in org.htmlcleaner
HTML text token.
ContentNode(char[], int) - Constructor for class org.htmlcleaner.ContentNode
 
ContentNode(String) - Constructor for class org.htmlcleaner.ContentNode
 
contentType - Variable in class org.htmlcleaner.TagInfo
 
continueAfterTags - Variable in class org.htmlcleaner.TagInfo
 
copyTags - Variable in class org.htmlcleaner.TagInfo
 
createDocumentNodes(List, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
createDOM(TagNode) - Method in class org.htmlcleaner.DomSerializer
 
createElement(TagNode) - Method in class org.htmlcleaner.JDomSerializer
 
createElement(TagNode, Document) - Method in class org.htmlcleaner.DomSerializer
 
createJDom(TagNode) - Method in class org.htmlcleaner.JDomSerializer
 
createSubnodes(Element, List) - Method in class org.htmlcleaner.JDomSerializer
 
createSubnodes(Document, Element, List) - Method in class org.htmlcleaner.DomSerializer
 
createTagNode(String) - Method in class org.htmlcleaner.HtmlTokenizer
 
createTagNode(String, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
createTagNode(TagNode) - Method in class org.htmlcleaner.HtmlCleaner
 

D

DEFAULT_CHARSET - Static variable in class org.htmlcleaner.HtmlCleaner
 
DEFAULT_INDENTATION_STRING - Static variable in class org.htmlcleaner.PrettyHtmlSerializer
 
DEFAULT_INDENTATION_STRING - Static variable in class org.htmlcleaner.PrettyXmlSerializer
 
DefaultTagProvider - Class in org.htmlcleaner
This class is automatically created from ConfigFileTagProvider which reads default XML configuration file with tag descriptions.
DefaultTagProvider() - Constructor for class org.htmlcleaner.DefaultTagProvider
 
defineAllowedChildrenTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineCloseBeforeCopyInsideTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineCloseBeforeTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineCloseInsideCopyAfterTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineFatalTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineForbiddenTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineHigherLevelTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineNamespaceDeclarations(TagNode, Element) - Method in class org.htmlcleaner.JDomSerializer
 
defineRequiredEnclosingTags(String) - Method in class org.htmlcleaner.TagInfo
 
dependencyName - Variable in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
deprecated - Variable in class org.htmlcleaner.TagInfo
 
dest - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
destTag - Variable in class org.htmlcleaner.TagTransformation
 
doctype() - Method in class org.htmlcleaner.HtmlTokenizer
 
docType - Variable in class org.htmlcleaner.TagNode
 
DoctypeToken - Class in org.htmlcleaner
HTML doctype token.
DoctypeToken(String, String, String, String) - Constructor for class org.htmlcleaner.DoctypeToken
 
DomSerializer - Class in org.htmlcleaner
DOM serializer - creates xml DOM.
DomSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.DomSerializer
 
DomSerializer(CleanerProperties, boolean) - Constructor for class org.htmlcleaner.DomSerializer
 
dontEscape(TagNode) - Method in class org.htmlcleaner.HtmlSerializer
 
dontEscape(TagNode) - Method in class org.htmlcleaner.XmlSerializer
 

E

endElement(String, String, String) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
EndTagToken - Class in org.htmlcleaner
HTML tag end token.
EndTagToken() - Constructor for class org.htmlcleaner.EndTagToken
 
EndTagToken(String) - Constructor for class org.htmlcleaner.EndTagToken
 
entities - Static variable in class org.htmlcleaner.SpecialEntity
 
escapeText(String) - Method in class org.htmlcleaner.HtmlSerializer
 
escapeXml - Variable in class org.htmlcleaner.DomSerializer
 
escapeXml - Variable in class org.htmlcleaner.JDomSerializer
 
escapeXml(String) - Method in class org.htmlcleaner.XmlSerializer
 
escapeXml(String, CleanerProperties, boolean) - Static method in class org.htmlcleaner.Utils
Escapes XML string.
evaluateAgainst(Collection, int, int, boolean, int, int, boolean, Collection) - Method in class org.htmlcleaner.XPather
 
evaluateAgainstNode(TagNode) - Method in class org.htmlcleaner.XPather
Main public method for this class - a way to execute XPath expression against specified TagNode instance.
evaluateFunction(Collection, int, int, int, int, boolean) - Method in class org.htmlcleaner.XPather
Evaluates specified function.
evaluateLogic(Collection, Collection, String) - Method in class org.htmlcleaner.XPather
Evaluates logic operation on two collections.
evaluateTemplate(String, Map) - Static method in class org.htmlcleaner.Utils
Evaluates string template for specified map of variables.
evaluateXPath(String) - Method in class org.htmlcleaner.TagNode
Evaluates XPath expression on give node.
execute() - Method in class org.htmlcleaner.HtmlCleanerForAnt
Implementation of Ant task execution.

F

factory - Variable in class org.htmlcleaner.JDomSerializer
 
fatalTag - Variable in class org.htmlcleaner.TagInfo
 
filterByCondition(Collection, int, int) - Method in class org.htmlcleaner.XPather
Filter nodes satisfying the condition
findClosingIndex(int, int) - Method in class org.htmlcleaner.XPather
 
findElement(TagNode.ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
Finds first element in the tree that satisfy specified condition.
findElementByAttValue(String, String, boolean, boolean) - Method in class org.htmlcleaner.TagNode
 
findElementByName(String, boolean) - Method in class org.htmlcleaner.TagNode
 
findElementHavingAttribute(String, boolean) - Method in class org.htmlcleaner.TagNode
 
findFirstTagPos() - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
 
findTag(String) - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
 
findTagToPlaceRubbish() - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
 
flatten(int, int) - Method in class org.htmlcleaner.XPather
 
fullUrl(String, String) - Static method in class org.htmlcleaner.Utils
Calculates full URL for specified page URL and link which could be full, absolute or relative like there can be found in A or IMG tags.

G

generateCode - Variable in class org.htmlcleaner.ConfigFileTagProvider
 
getAllElements(boolean) - Method in class org.htmlcleaner.TagNode
 
getAllElementsList(boolean) - Method in class org.htmlcleaner.TagNode
 
getArgValue(String[], String) - Static method in class org.htmlcleaner.CommandLine
 
getAsString(TagNode) - Method in class org.htmlcleaner.Serializer
 
getAsString(TagNode, boolean) - Method in class org.htmlcleaner.Serializer
 
getAsString(TagNode, String) - Method in class org.htmlcleaner.Serializer
 
getAsString(TagNode, String, boolean) - Method in class org.htmlcleaner.Serializer
 
getAttributeByName(String) - Method in class org.htmlcleaner.TagNode
 
getAttributes() - Method in class org.htmlcleaner.TagNode
 
getAttributeTransformations() - Method in class org.htmlcleaner.TagTransformation
 
getBelongsTo() - Method in class org.htmlcleaner.TagInfo
 
getBooleanAttributeValues() - Method in class org.htmlcleaner.CleanerProperties
 
getCharacter() - Method in class org.htmlcleaner.SpecialEntity
 
getCharsetFromContent(URL) - Static method in class org.htmlcleaner.Utils
 
getCharsetFromContentTypeString(String) - Static method in class org.htmlcleaner.Utils
 
getChildIndex(HtmlNode) - Method in class org.htmlcleaner.TagNode
 
getChildren() - Method in class org.htmlcleaner.TagNode
 
getChildTagList() - Method in class org.htmlcleaner.TagNode
 
getChildTags() - Method in class org.htmlcleaner.TagInfo
 
getChildTags() - Method in class org.htmlcleaner.TagNode
 
getCommentedContent() - Method in class org.htmlcleaner.CommentNode
 
getContent() - Method in class org.htmlcleaner.CommentNode
 
getContent() - Method in class org.htmlcleaner.ContentNode
 
getContent() - Method in class org.htmlcleaner.DoctypeToken
 
getContentType() - Method in class org.htmlcleaner.TagInfo
 
getContinueAfterTags() - Method in class org.htmlcleaner.TagInfo
 
getCopyTags() - Method in class org.htmlcleaner.TagInfo
 
getCurrentChar() - Method in class org.htmlcleaner.HtmlTokenizer
 
getDecimalNCR() - Method in class org.htmlcleaner.SpecialEntity
 
getDestTag() - Method in class org.htmlcleaner.TagTransformation
 
getDocType() - Method in class org.htmlcleaner.HtmlTokenizer
 
getDocType() - Method in class org.htmlcleaner.TagNode
 
getElementList(TagNode.ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
Get all elements in the tree that satisfy specified condition.
getElementListByAttValue(String, String, boolean, boolean) - Method in class org.htmlcleaner.TagNode
 
getElementListByName(String, boolean) - Method in class org.htmlcleaner.TagNode
 
getElementListHavingAttribute(String, boolean) - Method in class org.htmlcleaner.TagNode
 
getElements(TagNode.ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
 
getElementsByAttValue(String, String, boolean, boolean) - Method in class org.htmlcleaner.TagNode
 
getElementsByName(String, boolean) - Method in class org.htmlcleaner.TagNode
 
getElementsByName(Collection, int, int, boolean, boolean) - Method in class org.htmlcleaner.XPather
For the given source collection and specified name, returns collection of subnodes or attribute values.
getElementsHavingAttribute(String, boolean) - Method in class org.htmlcleaner.TagNode
 
getEntity(String) - Static method in class org.htmlcleaner.SpecialEntity
 
getEscapedValue() - Method in class org.htmlcleaner.SpecialEntity
 
getFatalTag() - Method in class org.htmlcleaner.TagInfo
 
getHexNCR() - Method in class org.htmlcleaner.SpecialEntity
 
getHigherTags() - Method in class org.htmlcleaner.TagInfo
 
getHyphenReplacementInComment() - Method in class org.htmlcleaner.CleanerProperties
 
getIndent(int) - Method in class org.htmlcleaner.PrettyHtmlSerializer
 
getIndent(int) - Method in class org.htmlcleaner.PrettyXmlSerializer
 
getIndentedText(String, int) - Method in class org.htmlcleaner.PrettyHtmlSerializer
 
getIndentedText(String, int) - Method in class org.htmlcleaner.PrettyXmlSerializer
 
getInnerHtml(TagNode) - Method in class org.htmlcleaner.HtmlCleaner
For the specified node, returns it's content as string.
getInstance() - Static method in class org.htmlcleaner.DefaultTagProvider
 
getIntCode() - Method in class org.htmlcleaner.SpecialEntity
 
getItemsToMove() - Method in class org.htmlcleaner.TagNode
 
getKey() - Method in class org.htmlcleaner.SpecialEntity
 
getLastTagPos() - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
 
getMaxEntityLength() - Static method in class org.htmlcleaner.SpecialEntity
 
getMustCloseTags() - Method in class org.htmlcleaner.TagInfo
 
getName() - Method in class org.htmlcleaner.DoctypeToken
 
getName() - Method in class org.htmlcleaner.TagInfo
 
getName() - Method in class org.htmlcleaner.TagToken
 
getNamespaceDeclarations() - Method in class org.htmlcleaner.TagNode
 
getNamespaceURIOnPath(String) - Method in class org.htmlcleaner.TagNode
 
getParent() - Method in class org.htmlcleaner.TagNode
 
getPart1() - Method in class org.htmlcleaner.DoctypeToken
 
getPart2() - Method in class org.htmlcleaner.DoctypeToken
 
getPart3() - Method in class org.htmlcleaner.DoctypeToken
 
getPart4() - Method in class org.htmlcleaner.DoctypeToken
 
getPermittedTags() - Method in class org.htmlcleaner.TagInfo
 
getProperties() - Method in class org.htmlcleaner.HtmlCleaner
 
getPruneTags() - Method in class org.htmlcleaner.CleanerProperties
 
getRequiredParent() - Method in class org.htmlcleaner.TagInfo
 
getSingleLineOfChildren(List) - Method in class org.htmlcleaner.PrettyHtmlSerializer
 
getSingleLineOfChildren(List) - Method in class org.htmlcleaner.PrettyXmlSerializer
 
getSourceTag() - Method in class org.htmlcleaner.TagTransformation
 
getTagInfo(String) - Method in class org.htmlcleaner.ConfigFileTagProvider
 
getTagInfo(String) - Method in class org.htmlcleaner.DefaultTagProvider
 
getTagInfo(String) - Method in interface org.htmlcleaner.ITagInfoProvider
 
getTagInfoProvider() - Method in class org.htmlcleaner.CleanerProperties
 
getTagInfoProvider() - Method in class org.htmlcleaner.HtmlCleaner
 
getText() - Method in class org.htmlcleaner.TagNode
 
getTokenList() - Method in class org.htmlcleaner.HtmlTokenizer
 
getTransformation(String) - Method in class org.htmlcleaner.CleanerTransformations
 
getTransformations() - Method in class org.htmlcleaner.HtmlCleaner
 
getXmlAsString(TagNode) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use getAsString() instead.
getXmlAsString(TagNode, String) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use getAsString() instead.
getXmlName(String) - Static method in class org.htmlcleaner.Utils
 
getXmlNSPrefix(String) - Static method in class org.htmlcleaner.Utils
 
go() - Method in class org.htmlcleaner.HtmlTokenizer
 
go(int) - Method in class org.htmlcleaner.HtmlTokenizer
 

H

hasAttribute(String) - Method in class org.htmlcleaner.TagNode
Checks existance of specified attribute.
hasAttributeTransformations() - Method in class org.htmlcleaner.TagTransformation
 
hasChildren() - Method in class org.htmlcleaner.TagNode
 
hasCopyTags() - Method in class org.htmlcleaner.TagInfo
 
hasPermittedTags() - Method in class org.htmlcleaner.TagInfo
 
hasTransformationForTag(String) - Method in class org.htmlcleaner.CleanerTransformations
 
HEAD - Static variable in class org.htmlcleaner.TagInfo
 
HEAD_AND_BODY - Static variable in class org.htmlcleaner.TagInfo
 
HeadlessTagNode(TagNode) - Constructor for class org.htmlcleaner.Serializer.HeadlessTagNode
 
headNode - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
 
higherTags - Variable in class org.htmlcleaner.TagInfo
 
HtmlCleaner - Class in org.htmlcleaner
Main HtmlCleaner class.
HtmlCleaner() - Constructor for class org.htmlcleaner.HtmlCleaner
Constructor - creates cleaner instance with default tag info provider and default properties.
HtmlCleaner(CleanerProperties) - Constructor for class org.htmlcleaner.HtmlCleaner
Constructor - creates the instance with default tag info provider and specified properties
HtmlCleaner(ITagInfoProvider) - Constructor for class org.htmlcleaner.HtmlCleaner
Constructor - creates the instance with specified tag info provider and default properties
HtmlCleaner(ITagInfoProvider, CleanerProperties) - Constructor for class org.htmlcleaner.HtmlCleaner
Constructor - creates the instance with specified tag info provider and specified properties
HtmlCleaner.CleanTimeValues - Class in org.htmlcleaner
 
HtmlCleaner.OpenTags - Class in org.htmlcleaner
Class that contains information and mathods for managing list of open, but unhandled tags.
HtmlCleaner.TagPos - Class in org.htmlcleaner
Contains information about single open tag
HtmlCleanerException - Exception in org.htmlcleaner
General HtmlCleaner runtime exception.
HtmlCleanerException() - Constructor for exception org.htmlcleaner.HtmlCleanerException
 
HtmlCleanerException(String) - Constructor for exception org.htmlcleaner.HtmlCleanerException
 
HtmlCleanerException(String, Throwable) - Constructor for exception org.htmlcleaner.HtmlCleanerException
 
HtmlCleanerException(Throwable) - Constructor for exception org.htmlcleaner.HtmlCleanerException
 
HtmlCleanerForAnt - Class in org.htmlcleaner
Support for ANT.
HtmlCleanerForAnt() - Constructor for class org.htmlcleaner.HtmlCleanerForAnt
 
htmlNode - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
 
HtmlNode - Interface in org.htmlcleaner
Marker interface denoting nodes of the document tree
HtmlSerializer - Class in org.htmlcleaner
Abstract HTML serializer - contains common logic for descendants.
HtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.HtmlSerializer
 
HtmlTokenizer - Class in org.htmlcleaner
Main HTML tokenizer.
HtmlTokenizer(Reader, CleanerProperties, CleanerTransformations, ITagInfoProvider) - Constructor for class org.htmlcleaner.HtmlTokenizer
Constructor - cretes instance of the parser with specified content.
hyphenreplacement - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
hyphenReplacementInComment - Variable in class org.htmlcleaner.CleanerProperties
 

I

identifier() - Method in class org.htmlcleaner.HtmlTokenizer
Parses an identifier from the current position.
ignorePermitted - Variable in class org.htmlcleaner.TagInfo
 
ignoreqe - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
ignoreQuestAndExclam - Variable in class org.htmlcleaner.CleanerProperties
 
ignoreUntil(char) - Method in class org.htmlcleaner.HtmlTokenizer
 
incharset - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
indents - Variable in class org.htmlcleaner.PrettyHtmlSerializer
 
indents - Variable in class org.htmlcleaner.PrettyXmlSerializer
 
indentString - Variable in class org.htmlcleaner.PrettyHtmlSerializer
 
indentString - Variable in class org.htmlcleaner.PrettyXmlSerializer
 
info - Variable in class org.htmlcleaner.HtmlCleaner.TagPos
 
insertChild(int, HtmlNode) - Method in class org.htmlcleaner.TagNode
Inserts specified node at specified position in array of children
insertChildAfter(HtmlNode, HtmlNode) - Method in class org.htmlcleaner.TagNode
Inserts specified node in the list of children after specified child
insertChildBefore(HtmlNode, HtmlNode) - Method in class org.htmlcleaner.TagNode
Inserts specified node in the list of children before specified child
intCode - Variable in class org.htmlcleaner.SpecialEntity
 
isAdvancedXmlEscape() - Method in class org.htmlcleaner.CleanerProperties
 
isAllowedInLastOpenTag(BaseToken, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
isAllowHtmlInsideAttributes - Variable in class org.htmlcleaner.HtmlTokenizer
 
isAllowHtmlInsideAttributes() - Method in class org.htmlcleaner.CleanerProperties
 
isAllowMultiWordAttributes - Variable in class org.htmlcleaner.HtmlTokenizer
 
isAllowMultiWordAttributes() - Method in class org.htmlcleaner.CleanerProperties
 
isAllRead() - Method in class org.htmlcleaner.HtmlTokenizer
Checks if end of the content is reached.
isAtt(String) - Method in class org.htmlcleaner.XPather
Checks if token is attribute (starts with @)
isCaseSensitive - Variable in class org.htmlcleaner.TagNode.TagNodeAttValueCondition
 
isChar(char) - Method in class org.htmlcleaner.HtmlTokenizer
Checks if character at current runtime position is equal to specified char.
isChar(int, char) - Method in class org.htmlcleaner.HtmlTokenizer
Checks if character at specified position is equal to specified char.
isCharEquals(char) - Method in class org.htmlcleaner.HtmlTokenizer
 
isCharSimple(char) - Method in class org.htmlcleaner.HtmlTokenizer
 
isContinueAfter(String) - Method in class org.htmlcleaner.TagInfo
 
isCopy(String) - Method in class org.htmlcleaner.TagInfo
 
isDeprecated() - Method in class org.htmlcleaner.TagInfo
 
isEmpty() - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
 
isEmptyString(Object) - Static method in class org.htmlcleaner.Utils
 
isEmptyTag() - Method in class org.htmlcleaner.TagInfo
 
isFatalTagSatisfied(TagInfo, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Checks if open fatal tag is missing if there is a fatal tag for the specified tag.
isFormed - Variable in class org.htmlcleaner.TagNode
 
isFormed() - Method in class org.htmlcleaner.TagNode
 
isFullUrl(String) - Static method in class org.htmlcleaner.Utils
Checks if specified link is full URL.
isFunctionCall(int, int) - Method in class org.htmlcleaner.XPather
Checks if tokens in specified range represents valid function call.
isHeadAndBodyTag() - Method in class org.htmlcleaner.TagInfo
 
isHeadTag() - Method in class org.htmlcleaner.TagInfo
 
isHexadecimalDigit(char) - Static method in class org.htmlcleaner.Utils
 
isHigher(String) - Method in class org.htmlcleaner.TagInfo
 
isIdentifier(String) - Method in class org.htmlcleaner.XPather
Checks if given string is valid identifier.
isIdentifierChar() - Method in class org.htmlcleaner.HtmlTokenizer
Checks if character at current runtime position can be identifier part.
isIdentifierHelperChar(char) - Static method in class org.htmlcleaner.Utils
Checks if specified character can be part of xml identifier (tag name of attribute name) and is not standard identifier character.
isIdentifierStartChar() - Method in class org.htmlcleaner.HtmlTokenizer
Checks if character at current runtime position can be identifier start.
isIdentifierStartChar(int) - Method in class org.htmlcleaner.HtmlTokenizer
Checks if character at specified position can be identifier start.
isIgnorePermitted() - Method in class org.htmlcleaner.TagInfo
 
isIgnoreQuestAndExclam() - Method in class org.htmlcleaner.CleanerProperties
 
isMinimizedTagSyntax(TagNode) - Method in class org.htmlcleaner.HtmlSerializer
 
isMinimizedTagSyntax(TagNode) - Method in class org.htmlcleaner.XmlSerializer
 
isMustCloseTag(TagInfo) - Method in class org.htmlcleaner.TagInfo
 
isNamespacesAware - Variable in class org.htmlcleaner.HtmlTokenizer
 
isNamespacesAware() - Method in class org.htmlcleaner.CleanerProperties
 
isOmitComments - Variable in class org.htmlcleaner.HtmlTokenizer
 
isOmitComments() - Method in class org.htmlcleaner.CleanerProperties
 
isOmitDeprecatedTags - Variable in class org.htmlcleaner.HtmlTokenizer
 
isOmitDeprecatedTags() - Method in class org.htmlcleaner.CleanerProperties
 
isOmitDoctypeDeclaration() - Method in class org.htmlcleaner.CleanerProperties
 
isOmitHtmlEnvelope() - Method in class org.htmlcleaner.CleanerProperties
 
isOmitUnknownTags - Variable in class org.htmlcleaner.HtmlTokenizer
 
isOmitUnknownTags() - Method in class org.htmlcleaner.CleanerProperties
 
isOmitXmlDeclaration() - Method in class org.htmlcleaner.CleanerProperties
 
isPreserveSourceAttributes() - Method in class org.htmlcleaner.TagTransformation
 
isRecognizeUnicodeChars() - Method in class org.htmlcleaner.CleanerProperties
 
isReservedTag(String) - Method in class org.htmlcleaner.HtmlTokenizer
Checks if specified tag name is one of the reserved tags: HTML, HEAD or BODY
isReservedXmlChar(char) - Static method in class org.htmlcleaner.Utils
 
isScriptOrStyle(TagNode) - Method in class org.htmlcleaner.Serializer
 
isStartToken(Object) - Method in class org.htmlcleaner.HtmlCleaner
 
isToken(String, int) - Method in class org.htmlcleaner.XPather
 
isTranslateSpecialEntities() - Method in class org.htmlcleaner.CleanerProperties
 
isTransResCharsToNCR() - Method in class org.htmlcleaner.CleanerProperties
 
isTransSpecialEntitiesToNCR() - Method in class org.htmlcleaner.CleanerProperties
 
isTreatDeprecatedTagsAsContent - Variable in class org.htmlcleaner.HtmlTokenizer
 
isTreatDeprecatedTagsAsContent() - Method in class org.htmlcleaner.CleanerProperties
 
isTreatUnknownTagsAsContent - Variable in class org.htmlcleaner.HtmlTokenizer
 
isTreatUnknownTagsAsContent() - Method in class org.htmlcleaner.CleanerProperties
 
isUnique() - Method in class org.htmlcleaner.TagInfo
 
isUseCdataForScriptAndStyle() - Method in class org.htmlcleaner.CleanerProperties
 
isUseEmptyElementTags() - Method in class org.htmlcleaner.CleanerProperties
 
isValid() - Method in class org.htmlcleaner.DoctypeToken
 
isValidDouble(String) - Method in class org.htmlcleaner.XPather
 
isValidInt(String, int) - Static method in class org.htmlcleaner.Utils
 
isValidInteger(String) - Method in class org.htmlcleaner.XPather
 
isValidXmlChar() - Method in class org.htmlcleaner.HtmlTokenizer
 
isValidXmlChar(char) - Static method in class org.htmlcleaner.Utils
 
isValidXmlCharSafe() - Method in class org.htmlcleaner.HtmlTokenizer
 
isValidXmlIdentifier(String) - Static method in class org.htmlcleaner.Utils
Chacks whether specified string can be valid tag name or attribute name in xml.
isWhitespace() - Method in class org.htmlcleaner.HtmlTokenizer
Checks if character at current runtime position is whitespace.
isWhitespace(int) - Method in class org.htmlcleaner.HtmlTokenizer
Checks if character at specified position is whitespace.
isWhitespaceSafe() - Method in class org.htmlcleaner.HtmlTokenizer
 
isWhitespaceString(Object) - Static method in class org.htmlcleaner.Utils
Checks whether specified object's string representation is empty string (containing of only whitespaces).
ITagInfoProvider - Interface in org.htmlcleaner
Provides set of TagInfo instances.
itemsToMove - Variable in class org.htmlcleaner.TagNode
 

J

JDomSerializer - Class in org.htmlcleaner
JDom serializer - creates xml JDom instance out of the TagNode.
JDomSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.JDomSerializer
 
JDomSerializer(CleanerProperties, boolean) - Constructor for class org.htmlcleaner.JDomSerializer
 

K

key - Variable in class org.htmlcleaner.SpecialEntity
 

L

last - Variable in class org.htmlcleaner.HtmlCleaner.OpenTags
 
list - Variable in class org.htmlcleaner.HtmlCleaner.OpenTags
 
ltrim(String) - Static method in class org.htmlcleaner.Utils
Trims specified string from left.

M

main(String[]) - Static method in class org.htmlcleaner.CommandLine
 
main(String[]) - Static method in class org.htmlcleaner.ConfigFileTagProvider
Generates code for tag provider class from specified configuration XML file.
makeCopy() - Method in class org.htmlcleaner.TagNode
 
makeTagNodeCopy(TagNode, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
makeTree(List<BaseToken>) - Method in class org.htmlcleaner.HtmlTokenizer
 
makeTree(List<BaseToken>, ListIterator<BaseToken>, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
mappings - Variable in class org.htmlcleaner.CleanerTransformations
 
maxEntityLength - Static variable in class org.htmlcleaner.SpecialEntity
 
mustAddRequiredParent(TagInfo, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Check if specified tag requires parent tag, but that parent tag is missing in the appropriate context.
mustCloseTags - Variable in class org.htmlcleaner.TagInfo
 

N

name - Variable in class org.htmlcleaner.HtmlCleaner.TagPos
 
name - Variable in class org.htmlcleaner.TagInfo
 
name - Variable in class org.htmlcleaner.TagNode.TagNodeNameCondition
 
name - Variable in class org.htmlcleaner.TagToken
 
namespacesaware - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
namespacesAware - Variable in class org.htmlcleaner.CleanerProperties
 
nodebyxpath - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
nsDeclarations - Variable in class org.htmlcleaner.TagNode
 

O

omitcomments - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
omitComments - Variable in class org.htmlcleaner.CleanerProperties
 
omitDeprecatedTags - Variable in class org.htmlcleaner.CleanerProperties
 
omitdeprtags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
omitdoctypedecl - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
omitDoctypeDeclaration - Variable in class org.htmlcleaner.CleanerProperties
 
omitenvelope - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
omithtmlenvelope - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
omitHtmlEnvelope - Variable in class org.htmlcleaner.CleanerProperties
 
omitunknowntags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
omitUnknownTags - Variable in class org.htmlcleaner.CleanerProperties
 
omitxmldecl - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
omitXmlDeclaration - Variable in class org.htmlcleaner.CleanerProperties
 
openPreTags - Variable in class org.htmlcleaner.CompactHtmlSerializer
 
OpenTags() - Constructor for class org.htmlcleaner.HtmlCleaner.OpenTags
 
org.htmlcleaner - package org.htmlcleaner
 
outcharset - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
outputtype - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 

P

parent - Variable in class org.htmlcleaner.TagNode
 
parse(InputSource) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
parserFactory - Static variable in class org.htmlcleaner.ConfigFileTagProvider
 
part1 - Variable in class org.htmlcleaner.DoctypeToken
 
part2 - Variable in class org.htmlcleaner.DoctypeToken
 
part3 - Variable in class org.htmlcleaner.DoctypeToken
 
part4 - Variable in class org.htmlcleaner.DoctypeToken
 
permittedTags - Variable in class org.htmlcleaner.TagInfo
 
position - Variable in class org.htmlcleaner.HtmlCleaner.TagPos
 
preserveSourceAttributes - Variable in class org.htmlcleaner.TagTransformation
 
PrettyHtmlSerializer - Class in org.htmlcleaner
Pretty HTML serializer - creates resulting HTML with indenting lines.
PrettyHtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.PrettyHtmlSerializer
 
PrettyHtmlSerializer(CleanerProperties, String) - Constructor for class org.htmlcleaner.PrettyHtmlSerializer
 
PrettyXmlSerializer - Class in org.htmlcleaner
Pretty XML serializer - creates resulting XML with indenting lines.
PrettyXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.PrettyXmlSerializer
 
PrettyXmlSerializer(CleanerProperties, String) - Constructor for class org.htmlcleaner.PrettyXmlSerializer
 
properties - Variable in class org.htmlcleaner.HtmlCleaner
 
props - Variable in class org.htmlcleaner.DomSerializer
 
props - Variable in class org.htmlcleaner.HtmlTokenizer
 
props - Variable in class org.htmlcleaner.JDomSerializer
 
props - Variable in class org.htmlcleaner.Serializer
 
pruneNodeSet - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
 
prunetags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
pruneTags - Variable in class org.htmlcleaner.CleanerProperties
 
pruneTagSet - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
 

R

readIfNeeded(int) - Method in class org.htmlcleaner.HtmlTokenizer
 
recognizeUnicodeChars - Variable in class org.htmlcleaner.CleanerProperties
 
removeAllChildren() - Method in class org.htmlcleaner.TagNode
Removes all children (subelements and text content).
removeAttribute(String) - Method in class org.htmlcleaner.TagNode
Removes specified attribute from this tag.
removeChild(Object) - Method in class org.htmlcleaner.TagNode
Remove specified child element from this node.
removeFromTree() - Method in class org.htmlcleaner.TagNode
Remove this node from the tree.
removeTag(String) - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
 
removeTagInfo(String) - Method in class org.htmlcleaner.DefaultTagProvider
Removes tag info with specified name.
replaceChild(HtmlNode, HtmlNode) - Method in class org.htmlcleaner.TagNode
Replaces specified child node with specified replacement node.
requiredParent - Variable in class org.htmlcleaner.TagInfo
 
RESERVED_XML_CHARS - Static variable in class org.htmlcleaner.Utils
 
rootNode - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
 
rtrim(String) - Static method in class org.htmlcleaner.Utils
Trims specified string from right.

S

satisfy(TagNode) - Method in interface org.htmlcleaner.TagNode.ITagNodeCondition
 
satisfy(TagNode) - Method in class org.htmlcleaner.TagNode.TagAllCondition
 
satisfy(TagNode) - Method in class org.htmlcleaner.TagNode.TagNodeAttExistsCondition
 
satisfy(TagNode) - Method in class org.htmlcleaner.TagNode.TagNodeAttValueCondition
 
satisfy(TagNode) - Method in class org.htmlcleaner.TagNode.TagNodeNameCondition
 
save(char) - Method in class org.htmlcleaner.HtmlTokenizer
Saves specified character to the temporary buffer.
saveCurrent() - Method in class org.htmlcleaner.HtmlTokenizer
Saves character at current runtime position to the temporary buffer.
saveCurrent(int) - Method in class org.htmlcleaner.HtmlTokenizer
Saves specified number of characters at current runtime position to the temporary buffer.
saveCurrentSafe() - Method in class org.htmlcleaner.HtmlTokenizer
 
saveToLastOpenTag(List, BaseToken, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
serialize(Serializer, Writer) - Method in interface org.htmlcleaner.BaseToken
 
serialize(Serializer, Writer) - Method in class org.htmlcleaner.CommentNode
 
serialize(Serializer, Writer) - Method in class org.htmlcleaner.ContentNode
 
serialize(Serializer, Writer) - Method in class org.htmlcleaner.DoctypeToken
 
serialize(Serializer, Writer) - Method in class org.htmlcleaner.EndTagToken
 
serialize(Serializer, Writer) - Method in class org.htmlcleaner.TagNode
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.BrowserCompactXmlSerializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.CompactHtmlSerializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.CompactXmlSerializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.PrettyHtmlSerializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.PrettyXmlSerializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.Serializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.SimpleHtmlSerializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.SimpleXmlSerializer
 
serializeEndTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.HtmlSerializer
 
serializeEndTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.XmlSerializer
 
serializeOpenTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.HtmlSerializer
 
serializeOpenTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.XmlSerializer
 
serializePrettyHtml(TagNode, Writer, int, boolean, boolean) - Method in class org.htmlcleaner.PrettyHtmlSerializer
 
serializePrettyXml(TagNode, Writer, int) - Method in class org.htmlcleaner.PrettyXmlSerializer
 
Serializer - Class in org.htmlcleaner
Basic abstract serializer - contains common logic for descendants (methods writeXXX().
Serializer(CleanerProperties) - Constructor for class org.htmlcleaner.Serializer
 
Serializer.HeadlessTagNode - Class in org.htmlcleaner
Used to implement serialization with missing envelope - omiting open and close tags, just serialize children.
set - Variable in class org.htmlcleaner.HtmlCleaner.OpenTags
 
setAdvancedxmlescape(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setAdvancedXmlEscape(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setAllowhtmlinsideattributes(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setAllowHtmlInsideAttributes(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setAllowmultiwordattributes(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setAllowMultiWordAttributes(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setAttribute(String, String) - Method in class org.htmlcleaner.EndTagToken
 
setAttribute(String, String) - Method in class org.htmlcleaner.TagNode
Adding new attribute ir overriding existing one.
setAttribute(String, String) - Method in class org.htmlcleaner.TagToken
 
setAttributes(TagNode, Element) - Method in class org.htmlcleaner.JDomSerializer
 
setAttributes(TagNode, Element) - Method in class org.htmlcleaner.DomSerializer
 
setBelongsTo(int) - Method in class org.htmlcleaner.TagInfo
 
setBooleanAttributeValues(String) - Method in class org.htmlcleaner.CleanerProperties
 
setBooleanatts(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setChildren(List) - Method in class org.htmlcleaner.TagNode
 
setChildTags(Set) - Method in class org.htmlcleaner.TagInfo
 
setContinueAfterTags(Set) - Method in class org.htmlcleaner.TagInfo
 
setCopyTags(Set) - Method in class org.htmlcleaner.TagInfo
 
setDeprecated(boolean) - Method in class org.htmlcleaner.TagInfo
 
setDest(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setDocType(DoctypeToken) - Method in class org.htmlcleaner.TagNode
 
setFatalTag(String) - Method in class org.htmlcleaner.TagInfo
 
setFormed() - Method in class org.htmlcleaner.TagNode
 
setFormed(boolean) - Method in class org.htmlcleaner.TagNode
 
setHigherTags(Set) - Method in class org.htmlcleaner.TagInfo
 
setHyphenreplacement(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setHyphenReplacementInComment(String) - Method in class org.htmlcleaner.CleanerProperties
 
setIgnorePermitted(boolean) - Method in class org.htmlcleaner.TagInfo
 
setIgnoreqe(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setIgnoreQuestAndExclam(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setIncharset(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setInnerHtml(TagNode, String) - Method in class org.htmlcleaner.HtmlCleaner
For the specified tag node, defines it's html content.
setItemsToMove(List<BaseToken>) - Method in class org.htmlcleaner.TagNode
 
setMustCloseTags(Set) - Method in class org.htmlcleaner.TagInfo
 
setName(String) - Method in class org.htmlcleaner.TagInfo
 
setName(String) - Method in class org.htmlcleaner.TagNode
Changes name of the tag
setNamespacesaware(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setNamespacesAware(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setNodebyxpath(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitcomments(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitComments(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setOmitDeprecatedTags(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setOmitdeprtags(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitdoctypedecl(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitDoctypeDeclaration(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setOmitenvelope(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmithtmlenvelope(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitHtmlEnvelope(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setOmitunknowntags(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitUnknownTags(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setOmitxmldecl(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitXmlDeclaration(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setOutcharset(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOutputtype(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setPermittedTags(Set) - Method in class org.htmlcleaner.TagInfo
 
setPrunetags(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setPruneTags(String) - Method in class org.htmlcleaner.CleanerProperties
 
setPruneTags(String, HtmlCleaner.CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
setRecognizeUnicodeChars(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setRequiredParent(String) - Method in class org.htmlcleaner.TagInfo
 
setSpecialentities(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setSrc(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setTaginfofile(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setText(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setTransform(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setTransformations(CleanerTransformations) - Method in class org.htmlcleaner.HtmlCleaner
Sets tranformations for this cleaner instance.
setTranslateSpecialEntities(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setTransrescharstoncr(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setTransResCharsToNCR(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setTransspecialentitiestoncr(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setTransSpecialEntitiesToNCR(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setTreatDeprecatedTagsAsContent(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setTreatdeprtagsascontent(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setTreatunknowntagsascontent(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setTreatUnknownTagsAsContent(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setUnicodechars(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setUnique(boolean) - Method in class org.htmlcleaner.TagInfo
 
setUsecdata(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setUseCdataForScriptAndStyle(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setUseemptyelementtags(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setUseEmptyElementTags(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
SimpleHtmlSerializer - Class in org.htmlcleaner
Simple HTML serializer - creates resulting HTML without indenting and/or compacting.
SimpleHtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.SimpleHtmlSerializer
 
SimpleXmlSerializer - Class in org.htmlcleaner
Simple XML serializer - creates resulting XML without indenting lines.
SimpleXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.SimpleXmlSerializer
 
singleton(Object) - Method in class org.htmlcleaner.XPather
Creates one-element collection for the specified object.
skipWhitespaces() - Method in class org.htmlcleaner.HtmlTokenizer
Skips whitespaces at current position and moves foreward until non-whitespace character is found or the end of content is reached.
someAlreadyOpen(Set) - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
Checks if any of tags specified in the set are already open.
sourceTag - Variable in class org.htmlcleaner.TagTransformation
 
specialentities - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
SpecialEntity - Class in org.htmlcleaner
This class contains map with special entities used in HTML and their unicodes.
SpecialEntity(String, int) - Constructor for class org.htmlcleaner.SpecialEntity
 
src - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
start() - Method in class org.htmlcleaner.HtmlTokenizer
Starts parsing HTML.
startElement(String, String, String, Attributes) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
startsWith(String) - Method in class org.htmlcleaner.HtmlTokenizer
Checks if content starts with specified value at the current position.
startsWithSimple(String) - Method in class org.htmlcleaner.HtmlTokenizer
 

T

TagAllCondition() - Constructor for class org.htmlcleaner.TagNode.TagAllCondition
 
tagAttributes() - Method in class org.htmlcleaner.HtmlTokenizer
Parses list tag attributes from the current position.
tagEncountered(String) - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
 
tagEnd() - Method in class org.htmlcleaner.HtmlTokenizer
Parses end of the tag.
tagExists(String) - Method in class org.htmlcleaner.HtmlCleaner.OpenTags
 
tagInfo - Variable in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
TagInfo - Class in org.htmlcleaner
Class contains information about single HTML tag.
It also contains rules for tag balancing.
TagInfo(String, int, int, boolean, boolean, boolean) - Constructor for class org.htmlcleaner.TagInfo
 
taginfofile - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
tagInfoMap - Variable in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
tagInfoProvider - Variable in class org.htmlcleaner.CleanerProperties
 
tagInfoProvider - Variable in class org.htmlcleaner.HtmlCleaner
 
tagInfoProvider - Variable in class org.htmlcleaner.HtmlTokenizer
 
TagNode - Class in org.htmlcleaner
XML node tag - basic node of the cleaned HTML tree.
TagNode(String) - Constructor for class org.htmlcleaner.TagNode
 
TagNode.ITagNodeCondition - Interface in org.htmlcleaner
Used as base for different node checkers.
TagNode.TagAllCondition - Class in org.htmlcleaner
All nodes.
TagNode.TagNodeAttExistsCondition - Class in org.htmlcleaner
Checks if node contains specified attribute.
TagNode.TagNodeAttValueCondition - Class in org.htmlcleaner
Checks if node has specified attribute with specified value.
TagNode.TagNodeNameCondition - Class in org.htmlcleaner
Checks if node has specified name.
TagNodeAttExistsCondition(String) - Constructor for class org.htmlcleaner.TagNode.TagNodeAttExistsCondition
 
TagNodeAttValueCondition(String, String, boolean) - Constructor for class org.htmlcleaner.TagNode.TagNodeAttValueCondition
 
TagNodeNameCondition(String) - Constructor for class org.htmlcleaner.TagNode.TagNodeNameCondition
 
TagNodeVisitor - Interface in org.htmlcleaner
Defines action to be performed on TagNodes
TagPos(int, String) - Constructor for class org.htmlcleaner.HtmlCleaner.TagPos
 
tagStart() - Method in class org.htmlcleaner.HtmlTokenizer
Parses start of the tag.
TagToken - Class in org.htmlcleaner
HTML tag token - descendants are start (TagNode) and end token (EndTagToken).
TagToken() - Constructor for class org.htmlcleaner.TagToken
 
TagToken(String) - Constructor for class org.htmlcleaner.TagToken
 
TagTransformation - Class in org.htmlcleaner
Describes how specified tag is transformed to another one, or is ignored during parsing
TagTransformation(String) - Constructor for class org.htmlcleaner.TagTransformation
Creates new tag transformation in which specified tag will be skipped (ignored) during parsing process.
TagTransformation(String, String) - Constructor for class org.htmlcleaner.TagTransformation
Creates new tag transformation from source tag to target tag preserving all source tag attributes.
TagTransformation(String, String, boolean) - Constructor for class org.htmlcleaner.TagTransformation
Creates new tag transformation from source tag to target tag specifying whether source tag attributes are preserved.
text - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
throwStandardException() - Method in class org.htmlcleaner.XPather
 
toBoolean(String) - Static method in class org.htmlcleaner.CommandLine
 
tokenArray - Variable in class org.htmlcleaner.XPather
 
tokenize(String, String) - Static method in class org.htmlcleaner.Utils
 
toString() - Method in class org.htmlcleaner.CommentNode
 
toString() - Method in class org.htmlcleaner.ContentNode
 
toString() - Method in class org.htmlcleaner.DoctypeToken
 
toString() - Method in class org.htmlcleaner.TagToken
 
toText(Object) - Method in class org.htmlcleaner.XPather
 
transform - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
transformations - Variable in class org.htmlcleaner.HtmlCleaner
 
transformations - Variable in class org.htmlcleaner.HtmlTokenizer
 
transformAttributes(TagTransformation) - Method in class org.htmlcleaner.TagNode
 
translateSpecialEntities - Variable in class org.htmlcleaner.CleanerProperties
 
transrescharstoncr - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
transResCharsToNCR - Variable in class org.htmlcleaner.CleanerProperties
 
transspecialentitiestoncr - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
transSpecialEntitiesToNCR - Variable in class org.htmlcleaner.CleanerProperties
 
traverse(TagNodeVisitor) - Method in class org.htmlcleaner.TagNode
Traverses the tree and performs visitor's action on each node.
traverseInternally(TagNodeVisitor) - Method in class org.htmlcleaner.TagNode
 
treatDeprecatedTagsAsContent - Variable in class org.htmlcleaner.CleanerProperties
 
treatdeprtagsascontent - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
treatunknowntagsascontent - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
treatUnknownTagsAsContent - Variable in class org.htmlcleaner.CleanerProperties
 

U

unicodechars - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
unique - Variable in class org.htmlcleaner.TagInfo
 
updateTagTransformations(CleanerTransformations, String, String) - Static method in class org.htmlcleaner.Utils
 
usecdata - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
useCdataForScriptAndStyle - Variable in class org.htmlcleaner.CleanerProperties
 
useemptyelementtags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
useEmptyElementTags - Variable in class org.htmlcleaner.CleanerProperties
 
Utils - Class in org.htmlcleaner
Common utilities.
Utils() - Constructor for class org.htmlcleaner.Utils
 

V

VAR_END - Static variable in class org.htmlcleaner.Utils
 
VAR_START - Static variable in class org.htmlcleaner.Utils
 
visit(TagNode, HtmlNode) - Method in interface org.htmlcleaner.TagNodeVisitor
Action to be performed on single node in the tree

W

WORKING_BUFFER_SIZE - Static variable in class org.htmlcleaner.HtmlTokenizer
 
write(TagNode, Writer, String) - Method in class org.htmlcleaner.Serializer
Writes specified node using specified writer.
write(TagNode, Writer, String, boolean) - Method in class org.htmlcleaner.Serializer
Writes specified node using specified writer.
writeToFile(TagNode, String) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the file, using system default charset.
writeToFile(TagNode, String, boolean) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the file, using specified charset and optionally omits node envelope (skips open and close tags of the node).
writeToFile(TagNode, String, String) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the file, using specified charset.
writeToFile(TagNode, String, String, boolean) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the file, using specified charset and optionally omits node envelope (skips open and close tags of the node).
writeToStream(TagNode, OutputStream) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the output stream, using system default charset.
writeToStream(TagNode, OutputStream, boolean) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the output stream, using system default charset and optionally omits node envelope (skips open and close tags of the node).
writeToStream(TagNode, OutputStream, String) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the output stream, using specified charset.
writeToStream(TagNode, OutputStream, String, boolean) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the output stream, using specified charset and optionally omits node envelope (skips open and close tags of the node).
writeXml(TagNode, Writer, String) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use write() instead.
writeXmlToFile(TagNode, String) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use writeToFile() instead.
writeXmlToFile(TagNode, String, String) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use writeToFile() instead.
writeXmlToStream(TagNode, OutputStream) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use writeToStream() instead.
writeXmlToStream(TagNode, OutputStream, String) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use writeToStream() instead.

X

XmlSerializer - Class in org.htmlcleaner
Abstract XML serializer - contains common logic for descendants.
XmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.XmlSerializer
 
XPather - Class in org.htmlcleaner
Utility for searching cleaned document tree with XPath expressions.
XPather(String) - Constructor for class org.htmlcleaner.XPather
Constructor - creates XPather instance with specified XPath expression.
XPatherException - Exception in org.htmlcleaner
Exception that could occure during XPather evaluation.
XPatherException() - Constructor for exception org.htmlcleaner.XPatherException
 
XPatherException(String) - Constructor for exception org.htmlcleaner.XPatherException
 
XPatherException(String, Throwable) - Constructor for exception org.htmlcleaner.XPatherException
 
XPatherException(Throwable) - Constructor for exception org.htmlcleaner.XPatherException
 

_

_asExpected - Variable in class org.htmlcleaner.HtmlTokenizer
 
_bodyOpened - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
 
_currentTagToken - Variable in class org.htmlcleaner.HtmlTokenizer
 
_docType - Variable in class org.htmlcleaner.HtmlTokenizer
 
_headOpened - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
 
_headTags - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
 
_instance - Static variable in class org.htmlcleaner.DefaultTagProvider
 
_isScriptContext - Variable in class org.htmlcleaner.HtmlTokenizer
 
_len - Variable in class org.htmlcleaner.HtmlTokenizer
 
_openTags - Variable in class org.htmlcleaner.HtmlCleaner.CleanTimeValues
 
_pos - Variable in class org.htmlcleaner.HtmlTokenizer
 
_reader - Variable in class org.htmlcleaner.HtmlTokenizer
 
_saved - Variable in class org.htmlcleaner.HtmlTokenizer
 
_savedLen - Variable in class org.htmlcleaner.HtmlTokenizer
 
_tokenList - Variable in class org.htmlcleaner.HtmlTokenizer
 
_working - Variable in class org.htmlcleaner.HtmlTokenizer
 
A B C D E F G H I J K L M N O P R S T U V W X _ 
All Classes All Packages