Package org.apache.mahout.utils.vectors.text

Class Summary
DictionaryVectorizer This class converts a set of input documents in the sequence file format to vectors.
DocumentProcessor This class converts a set of input documents in the sequence file format of StringTuples.The SequenceFile input should have a Text key containing the unique document identifier and a Text value containing the whole document.
 



Copyright © 2008-2010 The Apache Software Foundation. All Rights Reserved.