org.apache.mahout.text
Classes 
SparseVectorsFromSequenceFiles
TextParagraphSplittingJob
TextParagraphSplittingJob.SplitMap