Package org.apache.mahout.vectorizer.encoders

Class Summary
AdaptiveWordValueEncoder Encodes words into vectors much as does WordValueEncoder while maintaining an adaptive dictionary of values seen so far.
CachingContinuousValueEncoder  
CachingStaticWordValueEncoder  
CachingTextValueEncoder  
ConstantValueEncoder An encoder that does the standard thing for a virtual bias term.
ContinuousValueEncoder Continuous values are stored in fixed randomized location in the feature vector.
Dictionary Assigns integer codes to strings as they appear.
FeatureVectorEncoder General interface for objects that record features into a feature vector.
InteractionValueEncoder  
LuceneTextValueEncoder Encodes text using a lucene style tokenizer.
MurmurHash This is a very fast, non-cryptographic hash suitable for general hash-based lookup.
StaticWordValueEncoder Encodes a categorical values with an unbounded vocabulary.
TextValueEncoder Encodes text that is tokenized on non-alphanum separators.
WordValueEncoder Encodes words as sparse vector updates to a Vector.
 



Copyright © 2008-2010 The Apache Software Foundation. All Rights Reserved.