|
Class Summary |
| AbstractText |
Abstarct text is the base class for all Text extraction data. |
| GlyphText |
Glyph Sprite contains glyph bound and textual information for drawing
and text extraction. |
| LinePositionComparator |
The LinePositionComparator is optionally called by text extraction algorithms
to help insure text lines found on a page are ordered using the y coordinates
of the bounding box in the cartesian plane's fourth quadrant. |
| LineText |
Line text is make up WordText objects. |
| PageText |
Page text represents the root element of a page's text hierarchy which
looks something like this. |
| WordPositionComparator |
The WordPositionComparator is optionally called by text extraction algorithms
to help insure words found in a line are ordered using the x coordinates
of the bounding box in the cartesian plane's fourth quadrant. |
| WordText |
Word text represents an individual word in the document. |