Package org.apache.mahout.clustering.lda

Class Summary
LDADocumentTopicMapper  
LDADriver Estimates an LDA model from a corpus of documents, which are SparseVectors of word counts.
LDAInference Class for performing infererence on a document, which involves computing (an approximation to) p(word|topic) for each word and topic, and a prior distribution p(topic) for each topic.
LDAInference.InferredDocument An estimate of the probabilities for each document.
LDAReducer A very simple reducer which simply logSums the input doubles and outputs a new double for sufficient statistics, and sums log likelihoods.
LDAState  
LDAWordTopicMapper Runs inference on the input documents (which are sparse vectors of word counts) and outputs the sufficient statistics for the word-topic assignments.
 



Copyright © 2008-2011 The Apache Software Foundation. All Rights Reserved.