public abstract class HtmlTokenizer extends Object
It's task is to parse HTML and produce list of valid tokens: open tag tokens, end tag tokens, contents (text) and comments. As soon as new item is added to token list, cleaner is invoked to clean current list at the end.
Constructor and Description |
---|
HtmlTokenizer(Reader reader,
CleanerProperties props,
CleanerTransformations transformations,
ITagInfoProvider tagInfoProvider)
Constructor - cretes instance of the parser with specified content.
|
Modifier and Type | Method and Description |
---|---|
DoctypeToken |
getDocType() |
public HtmlTokenizer(Reader reader, CleanerProperties props, CleanerTransformations transformations, ITagInfoProvider tagInfoProvider) throws IOException
reader
- props
- transformations
- tagInfoProvider
- IOException
public DoctypeToken getDocType()
Copyright © 2006–2017. All rights reserved.