org.apache.mahout.df.tools
Class FrequenciesJob
java.lang.Object
org.apache.mahout.df.tools.FrequenciesJob
public class FrequenciesJob
- extends Object
Temporary class used to compute the frequency distribution of the "class attribute".
Constructor Summary |
FrequenciesJob(org.apache.hadoop.fs.Path base,
org.apache.hadoop.fs.Path dataPath,
org.apache.hadoop.fs.Path datasetPath)
|
Method Summary |
protected int[][] |
parseOutput(org.apache.hadoop.mapreduce.JobContext job)
Extracts the output and processes it |
int[][] |
run(org.apache.hadoop.conf.Configuration conf)
|
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
FrequenciesJob
public FrequenciesJob(org.apache.hadoop.fs.Path base,
org.apache.hadoop.fs.Path dataPath,
org.apache.hadoop.fs.Path datasetPath)
- Parameters:
base
- base directorydataPath
- data used in the first step
run
public int[][] run(org.apache.hadoop.conf.Configuration conf)
throws IOException,
ClassNotFoundException,
InterruptedException
- Returns:
- counts[partition][label] = num tuples from 'partition' with class == label
- Throws:
IOException
ClassNotFoundException
InterruptedException
parseOutput
protected int[][] parseOutput(org.apache.hadoop.mapreduce.JobContext job)
throws IOException
- Extracts the output and processes it
- Returns:
- counts[partition][label] = num tuples from 'partition' with class == label
- Throws:
IOException
Copyright © 2008-2011 The Apache Software Foundation. All Rights Reserved.