org.apache.nutch.net
Interface UrlNormalizer

All Superinterfaces:
Configurable
All Known Implementing Classes:
BasicUrlNormalizer, RegexUrlNormalizer

public interface UrlNormalizer
extends Configurable

Interface used to convert URLs to normal form and optionally do regex substitutions


Method Summary
 String normalize(String urlString)
           
 
Methods inherited from interface org.apache.hadoop.conf.Configurable
getConf, setConf
 

Method Detail

normalize

public String normalize(String urlString)
                 throws MalformedURLException
Throws:
MalformedURLException


Copyright © 2006 The Apache Software Foundation