org.apache.mahout.clustering.syntheticcontrol.meanshift
Class Job

java.lang.Object
  extended by org.apache.hadoop.conf.Configured
      extended by org.apache.mahout.common.AbstractJob
          extended by org.apache.mahout.clustering.syntheticcontrol.meanshift.Job
All Implemented Interfaces:
org.apache.hadoop.conf.Configurable, org.apache.hadoop.util.Tool

public final class Job
extends AbstractJob


Method Summary
static void main(String[] args)
           
 void run(org.apache.hadoop.conf.Configuration conf, org.apache.hadoop.fs.Path input, org.apache.hadoop.fs.Path output, DistanceMeasure measure, double t1, double t2, double convergenceDelta, int maxIterations)
          Run the meanshift clustering job on an input dataset using the given distance measure, t1, t2 and iteration parameters.
 int run(String[] args)
           
 
Methods inherited from class org.apache.mahout.common.AbstractJob
addFlag, addInputOption, addOption, addOption, addOption, addOption, addOutputOption, getInputPath, getOption, getOutputPath, hasOption, keyFor, maybePut, parseArguments, parseDirectories, prepareJob, shouldRunNextPhase
 
Methods inherited from class org.apache.hadoop.conf.Configured
getConf, setConf
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface org.apache.hadoop.conf.Configurable
getConf, setConf
 

Method Detail

main

public static void main(String[] args)
                 throws Exception
Throws:
Exception

run

public int run(String[] args)
        throws IOException,
               ClassNotFoundException,
               InterruptedException,
               InstantiationException,
               IllegalAccessException
Throws:
IOException
ClassNotFoundException
InterruptedException
InstantiationException
IllegalAccessException

run

public void run(org.apache.hadoop.conf.Configuration conf,
                org.apache.hadoop.fs.Path input,
                org.apache.hadoop.fs.Path output,
                DistanceMeasure measure,
                double t1,
                double t2,
                double convergenceDelta,
                int maxIterations)
         throws IOException,
                InterruptedException,
                ClassNotFoundException,
                InstantiationException,
                IllegalAccessException
Run the meanshift clustering job on an input dataset using the given distance measure, t1, t2 and iteration parameters. All output data will be written to the output directory, which will be initially deleted if it exists. The clustered points will reside in the path /clustered-points. By default, the job expects the a file containing synthetic_control.data as obtained from http://archive.ics.uci.edu/ml/datasets/Synthetic+Control+Chart+Time+Series resides in a directory named "testdata", and writes output to a directory named "output".

Parameters:
input - the String denoting the input directory path
output - the String denoting the output directory path
measure - the DistanceMeasure to use
t1 - the meanshift canopy T1 threshold
t2 - the meanshift canopy T2 threshold
convergenceDelta - the double convergence criteria for iterations
maxIterations - the int maximum number of iterations
Throws:
IOException
InterruptedException
ClassNotFoundException
InstantiationException
IllegalAccessException


Copyright © 2008-2010 The Apache Software Foundation. All Rights Reserved.