|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |
Class Summary | |
---|---|
SequenceFilesFromDirectory | Converts a directory of text documents into SequenceFiles of Specified chunkSize. |
SequenceFilesFromDirectory.ChunkedWriter | |
WikipediaMapper | Maps over Wikipedia xml format and output all document having the category listed in the input category file |
WikipediaToSequenceFile | Create and run the Wikipedia Dataset Creator. |
|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |