public abstract class EntityBasedMicroformatExtractor extends MicroformatExtractor
Extractor.BlindExtractor, Extractor.ContentExtractor, Extractor.TagSoupDOMExtractor
BEGIN_SCRIPT, END_SCRIPT, valueFactory
Constructor and Description |
---|
EntityBasedMicroformatExtractor() |
Modifier and Type | Method and Description |
---|---|
boolean |
extract()
Performs the extraction of the data and writes them to the model.
|
protected abstract boolean |
extractEntity(Node node,
ExtractionResult out)
Extracts an entity from a DOM node.
|
protected abstract String |
getBaseClassName()
Returns the base class name for the extractor.
|
protected org.openrdf.model.BNode |
getBlankNodeFor(Node node) |
protected abstract void |
resetExtractor()
Resets the internal status of the extractor to prepare it to a new extraction section.
|
addBNodeProperty, addBNodeProperty, addURIProperty, conditionallyAddLiteralProperty, conditionallyAddResourceProperty, conditionallyAddStringProperty, fixLink, fixLink, getCurrentExtractionResult, getDescription, getDocumentURI, getExtractionContext, getHTMLDocument, includes, openSubResult, run
protected abstract String getBaseClassName()
protected abstract void resetExtractor()
protected abstract boolean extractEntity(Node node, ExtractionResult out) throws ExtractionException
node
- the DOM node.out
- the extraction result collector.true
if the extraction has produces something, false
otherwise.ExtractionException
public boolean extract() throws ExtractionException
MicroformatExtractor
extract
in class MicroformatExtractor
ExtractionException
protected org.openrdf.model.BNode getBlankNodeFor(Node node)
node
- a DOM node representing a blank nodeCopyright © 2010-2013 The Apache Software Foundation. All Rights Reserved.