|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.apache.mahout.vectorizer.encoders.FeatureVectorEncoder
org.apache.mahout.vectorizer.encoders.TextValueEncoder
org.apache.mahout.vectorizer.encoders.LuceneTextValueEncoder
public class LuceneTextValueEncoder
Encodes text using a lucene style tokenizer.
TextValueEncoder
Field Summary |
---|
Fields inherited from class org.apache.mahout.vectorizer.encoders.FeatureVectorEncoder |
---|
CONTINUOUS_VALUE_HASH_SEED, WORD_LIKE_VALUE_HASH_SEED |
Constructor Summary | |
---|---|
LuceneTextValueEncoder(java.lang.String name)
|
Method Summary | |
---|---|
void |
setAnalyzer(org.apache.lucene.analysis.Analyzer analyzer)
|
protected java.lang.Iterable<java.lang.String> |
tokenize(java.lang.CharSequence originalForm)
Tokenizes a string using the simplest method. |
Methods inherited from class org.apache.mahout.vectorizer.encoders.TextValueEncoder |
---|
addText, addText, addToVector, asString, flush, hashesForProbe, hashForProbe, setWordEncoder |
Methods inherited from class org.apache.mahout.vectorizer.encoders.FeatureVectorEncoder |
---|
addToVector, addToVector, addToVector, bytesForString, getName, getProbes, getWeight, hash, hash, hash, hash, hash, isTraceEnabled, setProbes, setTraceDictionary, trace, trace |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public LuceneTextValueEncoder(java.lang.String name)
Method Detail |
---|
public void setAnalyzer(org.apache.lucene.analysis.Analyzer analyzer)
protected java.lang.Iterable<java.lang.String> tokenize(java.lang.CharSequence originalForm)
tokenize
in class TextValueEncoder
LuceneTextValueEncoder
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |