Package org.apache.mahout.clustering.lda

Class Summary
LDADriver Estimates an LDA model from a corpus of documents, which are SparseVectors of word counts.
LDAInference Class for performing infererence on a document, which involves computing (an approximation to) p(word|topic) for each word and topic, and a prior distribution p(topic) for each topic.
LDAInference.InferredDocument An estimate of the probabilitys for each document.
LDAMapper Runs inference on the input documents (which are sparse vectors of word counts) and outputs the sufficient statistics for the word-topic assignments.
LDAReducer A very simple reducer which simply logSums the input doubles and outputs a new double for sufficient statistics, and sums log likelihoods.
LDAState  
 



Copyright © 2008-2010 The Apache Software Foundation. All Rights Reserved.