org.apache.nutch.indexer
Class DeleteDuplicates.InputFormat

java.lang.Object
  extended byorg.apache.hadoop.mapred.InputFormatBase
      extended byorg.apache.nutch.indexer.DeleteDuplicates.InputFormat
All Implemented Interfaces:
InputFormat
Enclosing class:
DeleteDuplicates

public static class DeleteDuplicates.InputFormat
extends InputFormatBase


Field Summary
 
Fields inherited from class org.apache.hadoop.mapred.InputFormatBase
LOG
 
Constructor Summary
DeleteDuplicates.InputFormat()
           
 
Method Summary
 RecordReader getRecordReader(FileSystem fs, FileSplit split, JobConf job, Reporter reporter)
          Return each index as a split.
 FileSplit[] getSplits(FileSystem fs, JobConf job, int numSplits)
          Return each index as a split.
 
Methods inherited from class org.apache.hadoop.mapred.InputFormatBase
areValidInputDirectories, listFiles, listPaths, setMinSplitSize
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

DeleteDuplicates.InputFormat

public DeleteDuplicates.InputFormat()
Method Detail

getSplits

public FileSplit[] getSplits(FileSystem fs,
                             JobConf job,
                             int numSplits)
                      throws IOException
Return each index as a split.

Throws:
IOException

getRecordReader

public RecordReader getRecordReader(FileSystem fs,
                                    FileSplit split,
                                    JobConf job,
                                    Reporter reporter)
                             throws IOException
Return each index as a split.

Throws:
IOException


Copyright © 2006 The Apache Software Foundation