org.apache.mahout.classifier.bayes.mapreduce.common
Class BayesTfIdfMapper
java.lang.Object
org.apache.hadoop.mapred.MapReduceBase
org.apache.mahout.classifier.bayes.mapreduce.common.BayesTfIdfMapper
- All Implemented Interfaces:
- Closeable, org.apache.hadoop.mapred.JobConfigurable, org.apache.hadoop.mapred.Mapper<StringTuple,org.apache.hadoop.io.DoubleWritable,StringTuple,org.apache.hadoop.io.DoubleWritable>
public class BayesTfIdfMapper
- extends org.apache.hadoop.mapred.MapReduceBase
- implements org.apache.hadoop.mapred.Mapper<StringTuple,org.apache.hadoop.io.DoubleWritable,StringTuple,org.apache.hadoop.io.DoubleWritable>
Naive Bayes Tfidf Mapper. Calculates per document statistics
Method Summary |
void |
configure(org.apache.hadoop.mapred.JobConf job)
|
void |
map(StringTuple key,
org.apache.hadoop.io.DoubleWritable value,
org.apache.hadoop.mapred.OutputCollector<StringTuple,org.apache.hadoop.io.DoubleWritable> output,
org.apache.hadoop.mapred.Reporter reporter)
We need to calculate the Tf-Idf of each feature in each label |
Methods inherited from class org.apache.hadoop.mapred.MapReduceBase |
close |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
BayesTfIdfMapper
public BayesTfIdfMapper()
map
public void map(StringTuple key,
org.apache.hadoop.io.DoubleWritable value,
org.apache.hadoop.mapred.OutputCollector<StringTuple,org.apache.hadoop.io.DoubleWritable> output,
org.apache.hadoop.mapred.Reporter reporter)
throws IOException
- We need to calculate the Tf-Idf of each feature in each label
- Specified by:
map
in interface org.apache.hadoop.mapred.Mapper<StringTuple,org.apache.hadoop.io.DoubleWritable,StringTuple,org.apache.hadoop.io.DoubleWritable>
- Parameters:
key
- The label,feature pair (can either be the freq Count or the term Document count
- Throws:
IOException
configure
public void configure(org.apache.hadoop.mapred.JobConf job)
- Specified by:
configure
in interface org.apache.hadoop.mapred.JobConfigurable
- Overrides:
configure
in class org.apache.hadoop.mapred.MapReduceBase
Copyright © 2008-2012 The Apache Software Foundation. All Rights Reserved.