|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.apache.mahout.df.mapred.Builder
org.apache.mahout.df.mapred.partial.PartialBuilder
public class PartialBuilder
Builds a random forest using partial data. Each mapper uses only the data given by its InputSplit
Constructor Summary | |
---|---|
PartialBuilder(TreeBuilder treeBuilder,
org.apache.hadoop.fs.Path dataPath,
org.apache.hadoop.fs.Path datasetPath,
java.lang.Long seed)
|
|
PartialBuilder(TreeBuilder treeBuilder,
org.apache.hadoop.fs.Path dataPath,
org.apache.hadoop.fs.Path datasetPath,
java.lang.Long seed,
org.apache.hadoop.conf.Configuration conf)
|
Method Summary | |
---|---|
protected void |
configureJob(org.apache.hadoop.mapred.JobConf job,
int nbTrees,
boolean oobEstimate)
Used by the inheriting classes to configure the job |
protected static boolean |
isStep2(org.apache.hadoop.conf.Configuration conf)
Indicates if we should run the second step of the builder. This parameter is only meant for debuging, so we keep it protected. |
protected DecisionForest |
parseOutput(org.apache.hadoop.mapred.JobConf job,
PredictionCallback callback)
Parse the output files to extract the trees and pass the predictions to the callback |
protected static void |
processOutput(org.apache.hadoop.mapred.JobConf job,
org.apache.hadoop.fs.Path outputPath,
int[] firstIds,
TreeID[] keys,
Node[] trees,
PredictionCallback callback)
Processes the output from the output path. |
protected static void |
setStep2(org.apache.hadoop.conf.Configuration conf,
boolean value)
Should run the second step of the builder ? |
Methods inherited from class org.apache.mahout.df.mapred.Builder |
---|
build, getConf, getDataPath, getDatasetPath, getDistributedCacheFile, getNbTrees, getOutputPath, getRandomSeed, getSeed, getTreeBuilder, getTreeBuilder, isOobEstimate, isOutput, loadDataset, runJob, setNbTrees, setOutputDirName, sortSplits |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public PartialBuilder(TreeBuilder treeBuilder, org.apache.hadoop.fs.Path dataPath, org.apache.hadoop.fs.Path datasetPath, java.lang.Long seed)
public PartialBuilder(TreeBuilder treeBuilder, org.apache.hadoop.fs.Path dataPath, org.apache.hadoop.fs.Path datasetPath, java.lang.Long seed, org.apache.hadoop.conf.Configuration conf)
Method Detail |
---|
protected static boolean isStep2(org.apache.hadoop.conf.Configuration conf)
conf
-
protected static void setStep2(org.apache.hadoop.conf.Configuration conf, boolean value)
conf
- value
- true to indicate that the second step will be launchedprotected void configureJob(org.apache.hadoop.mapred.JobConf job, int nbTrees, boolean oobEstimate) throws java.io.IOException
Builder
configureJob
in class Builder
nbTrees
- number of trees to growoobEstimate
- true, if oob error should be estimated
java.io.IOException
protected DecisionForest parseOutput(org.apache.hadoop.mapred.JobConf job, PredictionCallback callback) throws java.io.IOException
Builder
parseOutput
in class Builder
callback
- can be null
java.io.IOException
protected static void processOutput(org.apache.hadoop.mapred.JobConf job, org.apache.hadoop.fs.Path outputPath, int[] firstIds, TreeID[] keys, Node[] trees, PredictionCallback callback) throws java.io.IOException
job
- outputPath
- directory that contains the output of the jobfirstIds
- partitions' first ids in hadoop's orderkeys
- callback
- can be null
java.io.IOException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |