org.apache.poi
Class POIOLE2TextExtractor
java.lang.Object
org.apache.poi.POITextExtractor
org.apache.poi.POIOLE2TextExtractor
- Direct Known Subclasses:
- EventBasedExcelExtractor, ExcelExtractor, PowerPointExtractor, PublisherTextExtractor, VisioTextExtractor, WordExtractor
public abstract class POIOLE2TextExtractor
- extends POITextExtractor
Common Parent for OLE2 based Text Extractors
of POI Documents, such as .doc, .xls
You will typically find the implementation of
a given format's text extractor under
org.apache.poi.[format].extractor .
- See Also:
ExcelExtractor
,
PowerPointExtractor
,
VisioTextExtractor
,
WordExtractor
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
POIOLE2TextExtractor
public POIOLE2TextExtractor(POIDocument document)
- Creates a new text extractor for the given document
getDocSummaryInformation
public DocumentSummaryInformation getDocSummaryInformation()
- Returns the document information metadata for the document
getSummaryInformation
public SummaryInformation getSummaryInformation()
- Returns the summary information metadata for the document
getMetadataTextExtractor
public POITextExtractor getMetadataTextExtractor()
- Returns an HPSF powered text extractor for the
document properties metadata, such as title and author.
- Specified by:
getMetadataTextExtractor
in class POITextExtractor
Copyright 2008 The Apache Software Foundation or
its licensors, as applicable.