|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.apache.mahout.clustering.lda.LDADriver
public final class LDADriver
Estimates an LDA model from a corpus of documents, which are SparseVectors of word counts. At each phase, it outputs a matrix of log probabilities of each topic.
Method Summary | |
---|---|
static void |
main(java.lang.String[] args)
|
static double |
runIteration(java.lang.String input,
java.lang.String stateIn,
java.lang.String stateOut,
int numTopics,
int numWords,
double topicSmoothing,
int numReducers)
Run the job using supplied arguments |
static void |
runJob(java.lang.String input,
java.lang.String output,
int numTopics,
int numWords,
double topicSmoothing,
int maxIterations,
int numReducers)
Run the job using supplied arguments |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Method Detail |
---|
public static void main(java.lang.String[] args) throws java.lang.ClassNotFoundException, java.io.IOException, java.lang.InterruptedException
java.lang.ClassNotFoundException
java.io.IOException
java.lang.InterruptedException
public static void runJob(java.lang.String input, java.lang.String output, int numTopics, int numWords, double topicSmoothing, int maxIterations, int numReducers) throws java.io.IOException, java.lang.InterruptedException, java.lang.ClassNotFoundException
input
- the directory pathname for input pointsoutput
- the directory pathname for output pointsnumTopics
- the number of topicsnumWords
- the number of wordstopicSmoothing
- pseudocounts for each topic, typically small < .5maxIterations
- the maximum number of iterationsnumReducers
- the number of Reducers desired
java.io.IOException
java.lang.InterruptedException
java.lang.ClassNotFoundException
public static double runIteration(java.lang.String input, java.lang.String stateIn, java.lang.String stateOut, int numTopics, int numWords, double topicSmoothing, int numReducers) throws java.io.IOException, java.lang.InterruptedException, java.lang.ClassNotFoundException
input
- the directory pathname for input pointsstateIn
- the directory pathname for input statestateOut
- the directory pathname for output statenumTopics
- the number of clustersnumReducers
- the number of Reducers desired
java.io.IOException
java.lang.InterruptedException
java.lang.ClassNotFoundException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |