Package | Description |
---|---|
org.apache.nutch.hostdb | |
org.apache.nutch.indexer |
Index content, configure and run indexing and cleaning jobs to
add, update, and delete documents from an index.
|
org.apache.nutch.parse |
The
Parse interface and related classes. |
Modifier and Type | Field and Description |
---|---|
protected URLNormalizers |
UpdateHostDbMapper.normalizers |
Modifier and Type | Field and Description |
---|---|
protected URLNormalizers |
IndexingFiltersChecker.normalizers |
Modifier and Type | Field and Description |
---|---|
protected URLNormalizers |
ParserChecker.normalizers |
Modifier and Type | Method and Description |
---|---|
static java.lang.String |
ParseOutputFormat.filterNormalize(java.lang.String fromUrl,
java.lang.String toUrl,
java.lang.String fromHost,
boolean ignoreInternalLinks,
boolean ignoreExternalLinks,
java.lang.String ignoreExternalLinksMode,
URLFilters filters,
URLExemptionFilters exemptionFilters,
URLNormalizers normalizers) |
static java.lang.String |
ParseOutputFormat.filterNormalize(java.lang.String fromUrl,
java.lang.String toUrl,
java.lang.String origin,
boolean ignoreInternalLinks,
boolean ignoreExternalLinks,
java.lang.String ignoreExternalLinksMode,
URLFilters filters,
URLExemptionFilters exemptionFilters,
URLNormalizers normalizers,
java.lang.String urlNormalizerScope) |
Copyright © 2018 The Apache Software Foundation