public static class Fetcher.InputFormat extends SequenceFileInputFormat<Text,CrawlDatum>
FileInputFormat.Counter
DEFAULT_LIST_STATUS_NUM_THREADS, INPUT_DIR, INPUT_DIR_RECURSIVE, LIST_STATUS_NUM_THREADS, NUM_INPUT_FILES, PATHFILTER_CLASS, SPLIT_MAXSIZE, SPLIT_MINSIZE
Constructor and Description |
---|
InputFormat() |
Modifier and Type | Method and Description |
---|---|
InputSplit[] |
getSplits(JobContext job,
int nSplits)
Don't split inputs, to keep things polite.
|
createRecordReader, getFormatMinSplitSize, listStatus
addInputPath, addInputPathRecursively, addInputPaths, computeSplitSize, getBlockIndex, getInputDirRecursive, getInputPathFilter, getInputPaths, getMaxSplitSize, getMinSplitSize, getSplits, isSplitable, makeSplit, makeSplit, setInputDirRecursive, setInputPathFilter, setInputPaths, setInputPaths, setMaxInputSplitSize, setMinInputSplitSize
public InputSplit[] getSplits(JobContext job, int nSplits) throws java.io.IOException
java.io.IOException
Copyright © 2018 The Apache Software Foundation