org.apache.mahout.utils.vectors.text.term
Class TermCountMapper

java.lang.Object
  extended by org.apache.hadoop.mapred.MapReduceBase
      extended by org.apache.mahout.utils.vectors.text.term.TermCountMapper
All Implemented Interfaces:
java.io.Closeable, org.apache.hadoop.mapred.JobConfigurable, org.apache.hadoop.mapred.Mapper<org.apache.hadoop.io.Text,StringTuple,org.apache.hadoop.io.Text,org.apache.hadoop.io.LongWritable>

public class TermCountMapper
extends org.apache.hadoop.mapred.MapReduceBase
implements org.apache.hadoop.mapred.Mapper<org.apache.hadoop.io.Text,StringTuple,org.apache.hadoop.io.Text,org.apache.hadoop.io.LongWritable>

TextVectorizer Term Count Mapper. Tokenizes a text document and outputs the count of the words


Constructor Summary
TermCountMapper()
           
 
Method Summary
 void map(org.apache.hadoop.io.Text key, StringTuple value, org.apache.hadoop.mapred.OutputCollector<org.apache.hadoop.io.Text,org.apache.hadoop.io.LongWritable> output, org.apache.hadoop.mapred.Reporter reporter)
           
 
Methods inherited from class org.apache.hadoop.mapred.MapReduceBase
close, configure
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface org.apache.hadoop.mapred.JobConfigurable
configure
 
Methods inherited from interface java.io.Closeable
close
 

Constructor Detail

TermCountMapper

public TermCountMapper()
Method Detail

map

public void map(org.apache.hadoop.io.Text key,
                StringTuple value,
                org.apache.hadoop.mapred.OutputCollector<org.apache.hadoop.io.Text,org.apache.hadoop.io.LongWritable> output,
                org.apache.hadoop.mapred.Reporter reporter)
         throws java.io.IOException
Specified by:
map in interface org.apache.hadoop.mapred.Mapper<org.apache.hadoop.io.Text,StringTuple,org.apache.hadoop.io.Text,org.apache.hadoop.io.LongWritable>
Throws:
java.io.IOException


Copyright © 2008-2010 The Apache Software Foundation. All Rights Reserved.