Uses of Interface
net.nutch.parse.HtmlParseFilter

Packages that use HtmlParseFilter
net.nutch.analysis.lang Text document language identifier. 
org.creativecommons.nutch Sample plugins that parse and index Creative Commons medadata. 
 

Uses of HtmlParseFilter in net.nutch.analysis.lang
 

Classes in net.nutch.analysis.lang that implement HtmlParseFilter
 class HTMLLanguageParser
          Adds metadata identifying language of document if found We could also run statistical analysis here but we'd miss all other formats
 

Uses of HtmlParseFilter in org.creativecommons.nutch
 

Classes in org.creativecommons.nutch that implement HtmlParseFilter
 class CCParseFilter
          Adds metadata identifying the Creative Commons license used, if any.
 



Copyright © 2005 The Nutch Organization.