public static class ArcSegmentCreator.ArcSegmentCreatorMapper extends Mapper<Text,BytesWritable,Text,NutchWritable>
Mapper.Context
Modifier and Type | Field and Description |
---|---|
static java.lang.String |
URL_VERSION |
Constructor and Description |
---|
ArcSegmentCreatorMapper() |
Modifier and Type | Method and Description |
---|---|
void |
map(Text key,
BytesWritable bytes,
Mapper.Context context)
Runs the Map job to translate an arc record into output for Nutch segments.
|
void |
setup(Mapper.Context context)
Configures the job mapper.
|
public static final java.lang.String URL_VERSION
public void setup(Mapper.Context context)
Configures the job mapper. Sets the url filters, scoring filters, url normalizers and other relevant data.
setup
in class Mapper<Text,BytesWritable,Text,NutchWritable>
job
- The job configuration.public void map(Text key, BytesWritable bytes, Mapper.Context context) throws java.io.IOException, java.lang.InterruptedException
Runs the Map job to translate an arc record into output for Nutch segments.
map
in class Mapper<Text,BytesWritable,Text,NutchWritable>
key
- The arc record header.bytes
- The arc record raw content bytes.context
- The context of the mapreduce job.java.io.IOException
java.lang.InterruptedException
Copyright © 2018 The Apache Software Foundation