public class Icu4jEncodingDetector extends Object implements org.apache.tika.detect.EncodingDetector
| Constructor and Description |
|---|
Icu4jEncodingDetector() |
| Modifier and Type | Method and Description |
|---|---|
Charset |
detect(InputStream input,
org.apache.tika.metadata.Metadata metadata) |
int |
getMarkLimit() |
boolean |
getStripMarkup() |
void |
setMarkLimit(int markLimit)
How far into the stream to read for charset detection.
|
void |
setStripMarkup(boolean stripMarkup)
Whether or not to attempt to strip html-ish markup
from the stream before sending it to the underlying
detector.
|
public Charset detect(InputStream input, org.apache.tika.metadata.Metadata metadata) throws IOException
detect in interface org.apache.tika.detect.EncodingDetectorIOException@Field public void setStripMarkup(boolean stripMarkup)
false.stripMarkup - whether or not to attempt to strip markup before
sending the stream to the underlying detectorpublic boolean getStripMarkup()
@Field public void setMarkLimit(int markLimit)
markLimit - public int getMarkLimit()
Copyright © 2007–2018 The Apache Software Foundation. All rights reserved.