org.odftoolkit.odfdom.incubator.doc.text
org.odftoolkit.simple.common.TextExtractor
in Simple API.public class OdfTextExtractor extends DefaultElementVisitor
If you pass the content root as the parameter, the whole document content will be returned, without any tag information.
It implements part of white space handling fuctions: text:p, text:h, text:s, text:tab, text:linebreak are processed according to ODF specification.
Modifier and Type | Field and Description |
---|---|
protected StringBuilder |
mTextBuilder
Deprecated.
|
protected static char |
NewLineChar
Deprecated.
|
protected static char |
TabChar
Deprecated.
|
Modifier | Constructor and Description |
---|---|
protected |
OdfTextExtractor()
Deprecated.
Default constructor
|
protected |
OdfTextExtractor(OdfElement element)
Deprecated.
Constructor with an ODF element as paramter
|
Modifier and Type | Method and Description |
---|---|
protected void |
appendElementText(OdfElement ele)
Deprecated.
Append the text content of this element to string buffer.
|
String |
getText()
Deprecated.
Return the text content as a string
|
static OdfTextExtractor |
newOdfTextExtractor(OdfElement element)
Deprecated.
An instance of OdfTextExtractor will be created to
extract the text content of an ODF element.
|
void |
visit(OdfElement element)
Deprecated.
Visit odf element and process some operations
|
void |
visit(TextHElement ele)
Deprecated.
A method to visit "text:h" elements
|
void |
visit(TextLineBreakElement ele)
Deprecated.
A method to visit "text:line-break" elements
|
void |
visit(TextPElement ele)
Deprecated.
A method to visit "text:p" elements
|
void |
visit(TextSElement ele)
Deprecated.
A method to visit "text:s" elements
|
void |
visit(TextTabElement ele)
Deprecated.
A method to visit "text:tab" elements
|
visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit
protected StringBuilder mTextBuilder
protected static final char NewLineChar
protected static final char TabChar
protected OdfTextExtractor()
protected OdfTextExtractor(OdfElement element)
element
- the ODF element whose text would be extracted.protected void appendElementText(OdfElement ele)
ele
- the ODF element whose text will be appended.public static OdfTextExtractor newOdfTextExtractor(OdfElement element)
element
- the ODF element whose text will be extracted.public void visit(OdfElement element)
ElementVisitor
visit
in interface ElementVisitor
visit
in class DefaultElementVisitor
element
- ODF elementpublic void visit(TextPElement ele)
DefaultElementVisitor
visit
in class DefaultElementVisitor
ele
- - an instance of TextPElementpublic void visit(TextHElement ele)
DefaultElementVisitor
visit
in class DefaultElementVisitor
ele
- - an instance of TextHElementpublic void visit(TextSElement ele)
DefaultElementVisitor
visit
in class DefaultElementVisitor
ele
- - an instance of TextSElementpublic void visit(TextTabElement ele)
DefaultElementVisitor
visit
in class DefaultElementVisitor
ele
- - an instance of TextTabElementpublic void visit(TextLineBreakElement ele)
DefaultElementVisitor
visit
in class DefaultElementVisitor
ele
- - an instance of TextLineBreakElementpublic String getText()
Copyright © 2008-2014 The Apache Software Foundation. All Rights Reserved.