|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
Vector
s
Attribute type handling:
Numeric -> As is
Nominal -> ordinal(value) i.e.SequenceFile
format.
StringTuple
s.The
SequenceFile
input should have a Text
key containing the unique document identifier and a
Text
value containing the whole document.DictionaryVectorizer
job
Writer
SequenceFileVectorIterable.SeqFileIterator.next()
is also valid
SequenceFile
generated by DictionaryVectorizer
Vector
.Map
.RandomAccessSparseVector
s into the complete Document
RandomAccessSparseVector
SequenceFile
format.
Vector
s.Weight
based on term frequency onlyStringTuple
The input documents has to be
in the SequenceFile
format
SequenceFile
of Vector
s and dump
out the results using Vector.asFormatString()
to either the console or to a
file.maxDocs
to the output.
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |