Package org.apache.mahout.utils.nlp.collocations.llr

Interface Summary
LLRReducer.LLCallback provide interface so the input to the llr calculation can be captured for validation in unit testing
 

Class Summary
CollocCombiner Combiner for pass1 of the CollocationDriver.
CollocDriver Driver for LLR Collocation discovery mapreduce job
CollocMapper Pass 1 of the Collocation discovery job which generated ngrams and emits ngrams an their component n-1grams.
CollocMapper.IteratorTokenStream Used to emit tokens from an input string array in the style of TokenStream
CollocReducer Reducer for Pass 1 of the collocation identification job.
Gram Writable for holding data generated from the collocation discovery jobs.
GramKey A GramKey, based on the identity fields of Gram (type, string) plus a byte[] used for secondary ordering
GramKeyGroupComparator Group GramKeys based on their Gram, ignoring the secondary sort key, so that all keys with the same Gram are sent to the same call of the reduce method, sorted in natural order (for GramKeys).
GramKeyPartitioner Partition GramKeys based on their Gram, ignoring the secondary sort key so that all GramKeys with the same gram are sent to the same partition.
LLRReducer Reducer for pass 2 of the collocation discovery job.
LLRReducer.ConcreteLLCallback concrete implementation delegates to LogLikelihood class
 

Enum Summary
CollocMapper.Count  
CollocReducer.Skipped  
Gram.Type  
LLRReducer.Skipped Counter to track why a particlar entry was skipped
 



Copyright © 2008-2010 The Apache Software Foundation. All Rights Reserved.