public class HtmlTokenizer extends Object
Its taks is to parse HTML and produce list of valid tokens: open tag tokens, end tag tokens, contents (text) and comments. As soon as new item is added to token list, cleaner is invoked to clean current list at the end.
Created by: Vladimir Nikic.| Constructor and Description |
|---|
HtmlTokenizer(HtmlCleaner cleaner)
Constructor - cretes instance of the parser with specified content.
|
public HtmlTokenizer(HtmlCleaner cleaner) throws IOException
cleaner - IOExceptionCopyright © 2017 Java Wikipedia API (Bliki engine). All rights reserved.