HTML Lexer Jar Home Page

Serialized Form


Package org.htmlparser

Class org.htmlparser.Attribute extends java.lang.Object implements Serializable

Serialized Fields

mName

java.lang.String mName
The name of this attribute. The part before the equals sign, or the stand-alone attribute. This will be null if the attribute is whitespace.


mAssignment

java.lang.String mAssignment
The assignment string of the attribute. The equals sign. This will be null if the attribute is a stand-alone attribute.


mValue

java.lang.String mValue
The value of the attribute. The part after the equals sign. This will be null if the attribute is an empty or stand-alone attribute.


mQuote

char mQuote
The quote, if any, surrounding the value of the attribute, if any. This will be zero if there are no quotes around the value.


Package org.htmlparser.http

Class org.htmlparser.http.Cookie extends java.lang.Object implements Serializable

Serialized Fields

mName

java.lang.String mName
The name of the cookie.


mValue

java.lang.String mValue
The cookie value.


mComment

java.lang.String mComment
Describes the cookie's use.


mDomain

java.lang.String mDomain
Domain that sees cookie.


mExpiry

java.util.Date mExpiry
Cookie expires after this date.


mPath

java.lang.String mPath
URLs that see the cookie.


mSecure

boolean mSecure
Use SSL.


mVersion

int mVersion
If Version=1 it means RFC 2109++ style cookies.


Package org.htmlparser.lexer

Class org.htmlparser.lexer.Cursor extends java.lang.Object implements Serializable

Serialized Fields

mPosition

int mPosition
This cursor's position.


mPage

Page mPage
This cursor's page.

Class org.htmlparser.lexer.InputStreamSource extends Source implements Serializable

Serialization Methods

readObject

private void readObject(java.io.ObjectInputStream in)
                 throws java.io.IOException,
                        java.lang.ClassNotFoundException
Deserialization support.

Throws:
java.io.IOException - If deserialization has a problem.
java.lang.ClassNotFoundException

writeObject

private void writeObject(java.io.ObjectOutputStream out)
                  throws java.io.IOException
Serialization support.

Throws:
java.io.IOException - If serialization has a problem.
Serialized Fields

mEncoding

java.lang.String mEncoding
The character set in use.


mBuffer

char[] mBuffer
The characters read so far.


mLevel

int mLevel
The number of valid bytes in the buffer.


mOffset

int mOffset
The offset of the next byte returned by read().


mMark

int mMark
The bookmark.

Class org.htmlparser.lexer.Lexer extends java.lang.Object implements Serializable

Serialized Fields

mPage

Page mPage
The page lexemes are retrieved from.


mCursor

Cursor mCursor
The current position on the page.


mFactory

NodeFactory mFactory
The factory for new nodes.

Class org.htmlparser.lexer.Page extends java.lang.Object implements Serializable

Serialization Methods

readObject

private void readObject(java.io.ObjectInputStream in)
                 throws java.io.IOException,
                        java.lang.ClassNotFoundException
Deserialize the page. For details see writeObject().

Throws:
java.io.IOException - If there is a deserialization problem with the stream.
java.lang.ClassNotFoundException - If the deserialized class can't be located with the current classpath and class loader.

writeObject

private void writeObject(java.io.ObjectOutputStream out)
                  throws java.io.IOException
Serialize the page. There are two modes to serializing a page based on the connected state. If connected, the URL and the current offset is saved, while if disconnected, the underling source is saved.

Throws:
java.io.IOException - If there is a serialization problem.
Serialized Fields

mUrl

java.lang.String mUrl
The URL this page is coming from. Cached value of getConnection().toExternalForm() or setUrl().


mBaseUrl

java.lang.String mBaseUrl
The base URL for this page.


mSource

Source mSource
The source of characters.


mIndex

PageIndex mIndex
Character positions of the first character in each line.

Class org.htmlparser.lexer.PageAttribute extends Attribute implements Serializable

Serialized Fields

mPage

Page mPage
The page this attribute is extracted from.


mNameStart

int mNameStart
The starting offset of the name within the page. If negative, the name is considered null.


mNameEnd

int mNameEnd
The ending offset of the name within the page.


mValueStart

int mValueStart
The starting offset of the value within the page. If negative, the value is considered null.


mValueEnd

int mValueEnd
The ending offset of the name within the page.

Class org.htmlparser.lexer.PageIndex extends java.lang.Object implements Serializable

Serialized Fields

mIncrement

int mIncrement
Increment for allocations.


mCount

int mCount
The number of valid elements.


mIndices

int[] mIndices
The elements.


mPage

Page mPage
The page associated with this index.

Class org.htmlparser.lexer.Source extends java.io.Reader implements Serializable

Class org.htmlparser.lexer.StringSource extends Source implements Serializable

Serialized Fields

mString

java.lang.String mString
The source of characters.


mOffset

int mOffset
The current offset into the string.


mEncoding

java.lang.String mEncoding
The encoding to report. Only used by StringSource.getEncoding().


mMark

int mMark
The bookmark.


Package org.htmlparser.nodes

Class org.htmlparser.nodes.AbstractNode extends java.lang.Object implements Serializable

Serialized Fields

mPage

Page mPage
The page this node came from.


nodeBegin

int nodeBegin
The beginning position of the tag in the line


nodeEnd

int nodeEnd
The ending position of the tag in the line


parent

Node parent
The parent of this node.


children

NodeList children
The children of this node.

Class org.htmlparser.nodes.RemarkNode extends AbstractNode implements Serializable

Serialized Fields

mText

java.lang.String mText
The contents of the remark node, or override text.

Class org.htmlparser.nodes.TagNode extends AbstractNode implements Serializable

Serialized Fields

mScanner

Scanner mScanner
The scanner for this tag.


mAttributes

java.util.Vector<E> mAttributes
The tag attributes. Objects of type Attribute. The first element is the tag name, subsequent elements being either whitespace or real attributes.

Class org.htmlparser.nodes.TextNode extends AbstractNode implements Serializable

Serialized Fields

mText

java.lang.String mText
The contents of the string node, or override text.


Package org.htmlparser.scanners

Class org.htmlparser.scanners.TagScanner extends java.lang.Object implements Serializable


Package org.htmlparser.util

Class org.htmlparser.util.ChainedException extends java.lang.Exception implements Serializable

Serialized Fields

throwable

java.lang.Throwable throwable

Class org.htmlparser.util.EncodingChangeException extends ParserException implements Serializable

Serialized Fields

mOldEncoding

java.lang.String mOldEncoding
The old encoding.


mNewEncoding

java.lang.String mNewEncoding
The new encoding.

Class org.htmlparser.util.NodeList extends java.lang.Object implements Serializable

Serialized Fields

nodeData

Node[] nodeData

size

int size

capacity

int capacity

capacityIncrement

int capacityIncrement

Class org.htmlparser.util.ParserException extends ChainedException implements Serializable


© 2006 Derrick Oswald
April 1, 2006

HTML Parser is an open source library released under Common Public License.SourceForge.net