| Package | Description |
|---|---|
| org.apache.tika.fork |
Forked parser.
|
| org.apache.tika.parser |
Tika parsers.
|
| org.apache.tika.parser.external |
External parser process.
|
| Modifier and Type | Class and Description |
|---|---|
class |
ForkParser |
| Modifier and Type | Class and Description |
|---|---|
class |
AbstractEncodingDetectorParser
Abstract base class for parsers that use the AutoDetectReader and need
to use the
EncodingDetector configured by TikaConfig |
class |
AbstractExternalProcessParser
Abstract base class for parsers that call external processes.
|
class |
AutoDetectParser |
class |
CompositeParser
Composite parser that delegates parsing tasks to a component parser
based on the declared content type of the incoming document.
|
class |
CryptoParser
Decrypts the incoming document stream and delegates further parsing to
another parser instance.
|
class |
DefaultParser
A composite parser based on all the
Parser implementations
available through the
service provider mechanism. |
class |
DelegatingParser
Base class for parser implementations that want to delegate parts of the
task of parsing an input document to another parser.
|
class |
DigestingParser |
class |
EmptyParser
Dummy parser that always produces an empty XHTML document without even
attempting to parse the given document stream.
|
class |
ErrorParser
Dummy parser that always throws a
TikaException without even
attempting to parse the given document stream. |
class |
NetworkParser |
class |
ParserDecorator
Decorator base class for the
Parser interface. |
class |
ParserPostProcessor
Parser decorator that post-processes the results from a decorated parser.
|
class |
RecursiveParserWrapper
This is a helper class that wraps a parser in a recursive handler.
|
| Modifier and Type | Class and Description |
|---|---|
class |
CompositeExternalParser
A Composite Parser that wraps up all the available External Parsers,
and provides an easy way to access them.
|
class |
ExternalParser
Parser that uses an external program (like catdoc or pdf2txt) to extract
text content and metadata from a given document.
|
Copyright © 2007–1969 The Apache Software Foundation. All rights reserved.