A B C D F G I K L M N O P R S T V W

A

accept(Path) - Method in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
accept(File) - Method in class org.apache.mahout.text.SequenceFilesFromMailArchives.PrefixAdditionFilter
 
addDateFormat(Integer, DateFormat) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
addDateFormat(Integer, DateFormat) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
addLabel(String, Integer) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
addLabel(String, Integer) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
addNominal(String, String, int) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
addNominal(String, String, int) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
addOptions() - Method in class org.apache.mahout.text.SequenceFilesFromCsvFilter
 
addOptions() - Method in class org.apache.mahout.text.SequenceFilesFromDirectory
Override this method in order to add additional options to the command line of the SequenceFileFromDirectory job.
addType(Integer, ARFFType) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
addType(Integer, ARFFType) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
ARFF_COMMENT - Static variable in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
ARFF_SPARSE - Static variable in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
ARFFModel - Interface in org.apache.mahout.utils.vectors.arff
An interface for representing an ARFFModel.
ARFFType - Enum in org.apache.mahout.utils.vectors.arff
 
ARFFVectorIterable - Class in org.apache.mahout.utils.vectors.arff
Read in ARFF (http://www.cs.waikato.ac.nz/~ml/weka/arff.html) and create Vectors

Attribute type handling: Numeric -> As is Nominal -> ordinal(value) i.e.

ARFFVectorIterable(File, ARFFModel) - Constructor for class org.apache.mahout.utils.vectors.arff.ARFFVectorIterable
 
ARFFVectorIterable(File, Charset, ARFFModel) - Constructor for class org.apache.mahout.utils.vectors.arff.ARFFVectorIterable
 
ARFFVectorIterable(String, ARFFModel) - Constructor for class org.apache.mahout.utils.vectors.arff.ARFFVectorIterable
 
ARFFVectorIterable(Reader, ARFFModel) - Constructor for class org.apache.mahout.utils.vectors.arff.ARFFVectorIterable
 
ATTRIBUTE - Static variable in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 

B

Bump125 - Class in org.apache.mahout.utils
Helps with making nice intervals at arbitrary scale.
Bump125() - Constructor for class org.apache.mahout.utils.Bump125
 

C

CachedTermInfo - Class in org.apache.mahout.utils.vectors.lucene
Caches TermEntries from a single field.
CachedTermInfo(IndexReader, String, int, int) - Constructor for class org.apache.mahout.utils.vectors.lucene.CachedTermInfo
 
CDbwEvaluator - Class in org.apache.mahout.clustering.cdbw
This class calculates the CDbw metric as defined in http://www.db-net.aueb.gr/index.php/corporate/content/download/227/833/file/HV_poster2002.pdf
CDbwEvaluator(Map<Integer, List<VectorWritable>>, List<Cluster>, DistanceMeasure) - Constructor for class org.apache.mahout.clustering.cdbw.CDbwEvaluator
For testing only
CDbwEvaluator(Configuration, Path) - Constructor for class org.apache.mahout.clustering.cdbw.CDbwEvaluator
Initialize a new instance from job information
charset - Variable in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
CHARSET_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromDirectory
 
CHUNK_SIZE_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromDirectory
 
ChunkedWriter - Class in org.apache.mahout.text
 
ChunkedWriter(Configuration, int, Path) - Constructor for class org.apache.mahout.text.ChunkedWriter
 
cleanup(Mapper<IntWritable, WeightedVectorWritable, IntWritable, WeightedVectorWritable>.Context) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
cleanup(Reducer<IntWritable, WeightedVectorWritable, IntWritable, VectorWritable>.Context) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsReducer
 
cloneBenchmark() - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
close() - Method in class org.apache.mahout.text.ChunkedWriter
 
close() - Method in class org.apache.mahout.text.SequenceFilesFromMailArchives.ChunkedWriter
 
close() - Method in class org.apache.mahout.utils.vectors.io.DelimitedTermInfoWriter
Does NOT close the underlying writer
close() - Method in class org.apache.mahout.utils.vectors.io.SequenceFileVectorWriter
 
close() - Method in class org.apache.mahout.utils.vectors.io.TextualVectorWriter
 
closestCentroidBenchmark(DistanceMeasure) - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
ClusterDumper - Class in org.apache.mahout.utils.clustering
 
ClusterDumper(Path, Path) - Constructor for class org.apache.mahout.utils.clustering.ClusterDumper
 
ClusterDumper() - Constructor for class org.apache.mahout.utils.clustering.ClusterDumper
 
ClusterEvaluator - Class in org.apache.mahout.clustering.evaluation
 
ClusterEvaluator(Map<Integer, List<VectorWritable>>, List<Cluster>, DistanceMeasure) - Constructor for class org.apache.mahout.clustering.evaluation.ClusterEvaluator
For testing only
ClusterEvaluator(Configuration, Path) - Constructor for class org.apache.mahout.clustering.evaluation.ClusterEvaluator
Initialize a new instance from job information
ClusterLabels - Class in org.apache.mahout.utils.vectors.lucene
Get labels for the cluster using Log Likelihood Ratio (LLR).
ClusterLabels(Path, Path, String, String, int, int) - Constructor for class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
computeNext() - Method in class org.apache.mahout.utils.vectors.csv.CSVVectorIterator
 
computeNext() - Method in class org.apache.mahout.utils.vectors.lucene.LuceneIterator
 
computeRmse(Path) - Method in class org.apache.mahout.utils.eval.ParallelFactorizationEvaluator
 
conf - Variable in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
configure(Map<Integer, List<VectorWritable>>, DistanceMeasure) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
configure(Map<Integer, List<VectorWritable>>) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsReducer
 
createBenchmark() - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
createSequenceFiles(File, String, String, int, Charset) - Method in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
CSVVectorIterator - Class in org.apache.mahout.utils.vectors.csv
Iterates a CSV file and produces Vector.
CSVVectorIterator(Reader) - Constructor for class org.apache.mahout.utils.vectors.csv.CSVVectorIterator
 
CSVVectorIterator(Reader, CSVStrategy) - Constructor for class org.apache.mahout.utils.vectors.csv.CSVVectorIterator
 

D

DATA - Static variable in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
DatasetSplitter - Class in org.apache.mahout.utils.eval
Split a recommendation dataset into a training and a test set
DatasetSplitter() - Constructor for class org.apache.mahout.utils.eval.DatasetSplitter
 
DEFAULT_MAX_LABELS - Static variable in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
DEFAULT_MIN_IDS - Static variable in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
DelimitedTermInfoWriter - Class in org.apache.mahout.utils.vectors.io
Write TermInfo to a Writer in a textual, delimited format with header.
DelimitedTermInfoWriter(Writer, String, String) - Constructor for class org.apache.mahout.utils.vectors.io.DelimitedTermInfoWriter
 
deserializeBenchmark() - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
DICTIONARY_OPTION - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 
DICTIONARY_TYPE_OPTION - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 
DISTANCE_MEASURE_KEY - Static variable in class org.apache.mahout.clustering.evaluation.RepresentativePointsDriver
 
distanceMeasureBenchmark(DistanceMeasure) - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
dotBenchmark() - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
Driver - Class in org.apache.mahout.utils.vectors.arff
 
Driver - Class in org.apache.mahout.utils.vectors.lucene
 

F

FILE_FILTER_CLASS_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromDirectory
 
fs - Variable in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 

G

getAllEntries() - Method in class org.apache.mahout.utils.vectors.lucene.CachedTermInfo
 
getAllEntries() - Method in interface org.apache.mahout.utils.vectors.TermInfo
 
getARFFType(Integer) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getARFFType(Integer) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
getCDbw() - Method in class org.apache.mahout.clustering.cdbw.CDbwEvaluator
Compute the CDbw validity metric (eqn 8).
getClusterIdToPoints() - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
getClusterLabels(Integer, Collection<WeightedVectorWritable>) - Method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
Get the list of labels, sorted by best score.
getDateFormat(Integer) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getDateFormat(Integer) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
getDateMap() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
Map of Date formatters used
getDocFreq() - Method in class org.apache.mahout.utils.vectors.TermEntry
 
getIdField() - Method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
getIndicator() - Method in enum org.apache.mahout.utils.vectors.arff.ARFFType
 
getLabel(String) - Method in enum org.apache.mahout.utils.vectors.arff.ARFFType
 
getLabelBindings() - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
The vector attributes (labels in Mahout speak)
getLabelBindings() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
The vector attributes (labels in Mahout speak), unmodifiable
getLabelIndex(String) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getLabelIndex(String) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
getLabels() - Method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
getLabelSize() - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getLabelSize() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
getMessageCount() - Method in class org.apache.mahout.text.SequenceFilesFromMailArchives.PrefixAdditionFilter
 
getModel() - Method in class org.apache.mahout.utils.vectors.arff.ARFFVectorIterable
Returns info about the ARFF content that was parsed.
getNominalMap() - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getNominalMap() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
Map nominals to ids.
getNominalValue(String, String) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getNominalValue(String, String) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
getNumTopFeatures() - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
getOptions() - Method in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
getOutput() - Method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
getOutputFile() - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
getRelation() - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getRelation() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
getRepresentativePoints(Configuration) - Static method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
getRepresentativePoints(Configuration, Path) - Static method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
getSubString() - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
getTerm() - Method in class org.apache.mahout.utils.vectors.TermEntry
 
getTermDictionary() - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
getTermEntry(String, String) - Method in class org.apache.mahout.utils.vectors.lucene.CachedTermInfo
 
getTermEntry(String, String) - Method in interface org.apache.mahout.utils.vectors.TermInfo
 
getTermIdx() - Method in class org.apache.mahout.utils.vectors.TermEntry
 
getTopFeatures(Vector, String[], int) - Static method in class org.apache.mahout.utils.clustering.ClusterDumper
 
getTypeMap() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
The map of types encountered
getValue(String, int) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getValue(String, int) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
Convert a piece of String data at a specific spot into a value
getVector() - Method in class org.apache.mahout.utils.vectors.lucene.TFDFMapper
 
getVector() - Method in class org.apache.mahout.utils.vectors.lucene.VectorMapper
Can be called after the TermVector has been mapped
getWordCount() - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
The count of the number of words seen
getWordCount() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
The count of the number of words seen
getWords() - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
getWords() - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
Immutable map of words to the long id used for those words
getWriter() - Method in class org.apache.mahout.utils.vectors.io.SequenceFileVectorWriter
 
getWriter() - Method in class org.apache.mahout.utils.vectors.io.TextualVectorWriter
 

I

increment() - Method in class org.apache.mahout.utils.Bump125
 
incrementalCreateBenchmark() - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
InMemoryFactorizationEvaluator - Class in org.apache.mahout.utils.eval
Measures the root-mean-squared error of a ratring matrix factorization against a test set.
InMemoryFactorizationEvaluator() - Constructor for class org.apache.mahout.utils.eval.InMemoryFactorizationEvaluator
 
InputDriver - Class in org.apache.mahout.clustering.conversion
This class converts text files containing space-delimited floating point numbers into Mahout sequence files of VectorWritable suitable for input to the clustering jobs in particular, and any Mahout job requiring this input in general.
InputDriver - Class in org.apache.mahout.clustering.conversion.meanshift
This class converts text files containing space-delimited floating point numbers into Mahout sequence files of MeanShiftCanopy suitable for input to the MeanShift clustering job.
InputMapper - Class in org.apache.mahout.clustering.conversion
 
InputMapper() - Constructor for class org.apache.mahout.clustering.conversion.InputMapper
 
InputMapper - Class in org.apache.mahout.clustering.conversion.meanshift
 
InputMapper() - Constructor for class org.apache.mahout.clustering.conversion.meanshift.InputMapper
 
interClusterDensity() - Method in class org.apache.mahout.clustering.cdbw.CDbwEvaluator
This function evaluates the average density of points in the regions between clusters (eqn 1).
interClusterDensity() - Method in class org.apache.mahout.clustering.evaluation.ClusterEvaluator
Computes the inter-cluster density as defined in "Mahout In Action"
intraClusterDensity() - Method in class org.apache.mahout.clustering.cdbw.CDbwEvaluator
The average density within clusters is defined as the percentage of representative points that reside in the neighborhood of the clusters' centers.
intraClusterDensity() - Method in class org.apache.mahout.clustering.evaluation.ClusterEvaluator
Computes the intra-cluster density as the average distance of the representative points from each other
isIgnoringOffsets() - Method in class org.apache.mahout.utils.vectors.lucene.TFDFMapper
 
isIgnoringPositions() - Method in class org.apache.mahout.utils.vectors.lucene.TFDFMapper
 
iterator() - Method in class org.apache.mahout.utils.vectors.arff.ARFFVectorIterable
 
iterator() - Method in class org.apache.mahout.utils.vectors.lucene.LuceneIterable
 

K

KEY_COLUMN_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromCsvFilter
 
KEY_PREFIX_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromDirectory
 

L

LDAPrintTopics - Class in org.apache.mahout.clustering.lda
Class to print out the top K words for each topic.
loadTermDictionary(File) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
Read in a dictionary file.
loadTermDictionary(Configuration, String) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
Read a dictionary in SequenceFile generated by DictionaryVectorizer
LuceneIterable - Class in org.apache.mahout.utils.vectors.lucene
Iterable counterpart to LuceneIterator.
LuceneIterable(IndexReader, String, String, VectorMapper) - Constructor for class org.apache.mahout.utils.vectors.lucene.LuceneIterable
 
LuceneIterable(IndexReader, String, String, VectorMapper, double) - Constructor for class org.apache.mahout.utils.vectors.lucene.LuceneIterable
 
LuceneIterable(IndexReader, String, String, VectorMapper, double, double) - Constructor for class org.apache.mahout.utils.vectors.lucene.LuceneIterable
Produce a LuceneIterable that can create the Vector plus normalize it.
LuceneIterator - Class in org.apache.mahout.utils.vectors.lucene
An Iterator over Vectors that uses a Lucene index as the source for creating the Vectors.
LuceneIterator(IndexReader, String, String, VectorMapper, double) - Constructor for class org.apache.mahout.utils.vectors.lucene.LuceneIterator
Produce a LuceneIterable that can create the Vector plus normalize it.
LuceneIterator(IndexReader, String, String, VectorMapper, double, double) - Constructor for class org.apache.mahout.utils.vectors.lucene.LuceneIterator
 

M

MailArchivesClusteringAnalyzer - Class in org.apache.mahout.text
Custom Lucene Analyzer designed for aggressive feature reduction for clustering the ASF Mail Archives using an extended set of stop words, excluding non-alpha-numeric tokens, and porter stemming.
MailArchivesClusteringAnalyzer() - Constructor for class org.apache.mahout.text.MailArchivesClusteringAnalyzer
 
MailArchivesClusteringAnalyzer(CharArraySet) - Constructor for class org.apache.mahout.text.MailArchivesClusteringAnalyzer
 
main(String[]) - Static method in class org.apache.mahout.benchmark.VectorBenchmarks
 
main(String[]) - Static method in class org.apache.mahout.clustering.conversion.InputDriver
 
main(String[]) - Static method in class org.apache.mahout.clustering.conversion.meanshift.InputDriver
 
main(String[]) - Static method in class org.apache.mahout.clustering.evaluation.RepresentativePointsDriver
 
main(String[]) - Static method in class org.apache.mahout.clustering.lda.LDAPrintTopics
 
main(String[]) - Static method in class org.apache.mahout.text.SequenceFilesFromCsvFilter
 
main(String[]) - Static method in class org.apache.mahout.text.SequenceFilesFromDirectory
 
main(String[]) - Static method in class org.apache.mahout.text.SequenceFilesFromMailArchives
 
main(String[]) - Static method in class org.apache.mahout.text.TextParagraphSplittingJob
 
main(String[]) - Static method in class org.apache.mahout.utils.clustering.ClusterDumper
 
main(String[]) - Static method in class org.apache.mahout.utils.eval.DatasetSplitter
 
main(String[]) - Static method in class org.apache.mahout.utils.eval.InMemoryFactorizationEvaluator
 
main(String[]) - Static method in class org.apache.mahout.utils.eval.ParallelFactorizationEvaluator
 
main(String[]) - Static method in class org.apache.mahout.utils.SequenceFileDumper
 
main(String[]) - Static method in class org.apache.mahout.utils.vectors.arff.Driver
 
main(String[]) - Static method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
main(String[]) - Static method in class org.apache.mahout.utils.vectors.lucene.Driver
 
main(String[]) - Static method in class org.apache.mahout.utils.vectors.RowIdJob
 
main(String[]) - Static method in class org.apache.mahout.utils.vectors.VectorDumper
 
map(LongWritable, Text, Mapper<LongWritable, Text, Text, VectorWritable>.Context) - Method in class org.apache.mahout.clustering.conversion.InputMapper
 
map(LongWritable, Text, Mapper<LongWritable, Text, Text, MeanShiftCanopy>.Context) - Method in class org.apache.mahout.clustering.conversion.meanshift.InputMapper
 
map(IntWritable, WeightedVectorWritable, Mapper<IntWritable, WeightedVectorWritable, IntWritable, WeightedVectorWritable>.Context) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
map(Text, Text, Mapper<Text, Text, Text, Text>.Context) - Method in class org.apache.mahout.text.TextParagraphSplittingJob.SplitMap
 
map(LongWritable, Text, Mapper<LongWritable, Text, IntPairWritable, DoubleWritable>.Context) - Method in class org.apache.mahout.utils.eval.ParallelFactorizationEvaluator.PairsWithRatingMapper
 
map(String, int, TermVectorOffsetInfo[], int[]) - Method in class org.apache.mahout.utils.vectors.lucene.TFDFMapper
 
MapBackedARFFModel - Class in org.apache.mahout.utils.vectors.arff
Holds ARFF information in Map.
MapBackedARFFModel() - Constructor for class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
MapBackedARFFModel(Map<String, Long>, long, Map<String, Map<String, Integer>>) - Constructor for class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
mapPoint(IntWritable, WeightedVectorWritable, DistanceMeasure, Map<Integer, List<VectorWritable>>, Map<Integer, WeightedVectorWritable>) - Static method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 

N

NO_NORMALIZING - Static variable in class org.apache.mahout.utils.vectors.lucene.LuceneIterable
 
NUM_WORDS_OPTION - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 

O

options - Variable in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
org.apache.mahout.benchmark - package org.apache.mahout.benchmark
 
org.apache.mahout.clustering.cdbw - package org.apache.mahout.clustering.cdbw
 
org.apache.mahout.clustering.conversion - package org.apache.mahout.clustering.conversion
 
org.apache.mahout.clustering.conversion.meanshift - package org.apache.mahout.clustering.conversion.meanshift
 
org.apache.mahout.clustering.evaluation - package org.apache.mahout.clustering.evaluation
 
org.apache.mahout.clustering.lda - package org.apache.mahout.clustering.lda
 
org.apache.mahout.text - package org.apache.mahout.text
 
org.apache.mahout.utils - package org.apache.mahout.utils
 
org.apache.mahout.utils.clustering - package org.apache.mahout.utils.clustering
 
org.apache.mahout.utils.eval - package org.apache.mahout.utils.eval
 
org.apache.mahout.utils.vectors - package org.apache.mahout.utils.vectors
 
org.apache.mahout.utils.vectors.arff - package org.apache.mahout.utils.vectors.arff
 
org.apache.mahout.utils.vectors.csv - package org.apache.mahout.utils.vectors.csv
 
org.apache.mahout.utils.vectors.io - package org.apache.mahout.utils.vectors.io
 
org.apache.mahout.utils.vectors.lucene - package org.apache.mahout.utils.vectors.lucene
 
OUTPUT_OPTION - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 

P

ParallelFactorizationEvaluator - Class in org.apache.mahout.utils.eval
Measures the root-mean-squared error of a ratring matrix factorization against a test set.
ParallelFactorizationEvaluator() - Constructor for class org.apache.mahout.utils.eval.ParallelFactorizationEvaluator
 
ParallelFactorizationEvaluator.ErrorReducer - Class in org.apache.mahout.utils.eval
 
ParallelFactorizationEvaluator.ErrorReducer() - Constructor for class org.apache.mahout.utils.eval.ParallelFactorizationEvaluator.ErrorReducer
 
ParallelFactorizationEvaluator.PairsWithRatingMapper - Class in org.apache.mahout.utils.eval
 
ParallelFactorizationEvaluator.PairsWithRatingMapper() - Constructor for class org.apache.mahout.utils.eval.ParallelFactorizationEvaluator.PairsWithRatingMapper
 
parseOptions() - Method in class org.apache.mahout.text.SequenceFilesFromCsvFilter
 
parseOptions() - Method in class org.apache.mahout.text.SequenceFilesFromDirectory
Override this method in order to parse your additional options from the command line.
POINTS_DIR_OPTION - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 
prefix - Variable in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
PrefixAdditionFilter - Class in org.apache.mahout.text
Default parser for parsing text into sequence files.
PrefixAdditionFilter(Configuration, String, Map<String, String>, ChunkedWriter) - Constructor for class org.apache.mahout.text.PrefixAdditionFilter
 
printClusters(String[]) - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
process(FileStatus, Path) - Method in class org.apache.mahout.text.PrefixAdditionFilter
 
process(FileStatus, Path) - Method in class org.apache.mahout.text.SequenceFilesFromCsvFilter
 
process(FileStatus, Path) - Method in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
processDate(String, int) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
processNominal(String, String) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
processNumeric(String) - Static method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
processString(String) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 

R

readPoints(Path, Configuration) - Static method in class org.apache.mahout.utils.clustering.ClusterDumper
 
reduce(IntWritable, Iterable<WeightedVectorWritable>, Reducer<IntWritable, WeightedVectorWritable, IntWritable, VectorWritable>.Context) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsReducer
 
reduce(IntPairWritable, Iterable<DoubleWritable>, Reducer<IntPairWritable, DoubleWritable, DoubleWritable, NullWritable>.Context) - Method in class org.apache.mahout.utils.eval.ParallelFactorizationEvaluator.ErrorReducer
 
RELATION - Static variable in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
RepresentativePointsDriver - Class in org.apache.mahout.clustering.evaluation
 
RepresentativePointsMapper - Class in org.apache.mahout.clustering.evaluation
 
RepresentativePointsMapper() - Constructor for class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
RepresentativePointsReducer - Class in org.apache.mahout.clustering.evaluation
 
RepresentativePointsReducer() - Constructor for class org.apache.mahout.clustering.evaluation.RepresentativePointsReducer
 
RowIdJob - Class in org.apache.mahout.utils.vectors
 
RowIdJob() - Constructor for class org.apache.mahout.utils.vectors.RowIdJob
 
run(String[]) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsDriver
 
run(Configuration, Path, Path, Path, DistanceMeasure, int, boolean) - Static method in class org.apache.mahout.clustering.evaluation.RepresentativePointsDriver
 
run(Configuration, String, Map<String, String>, Path, Path) - Method in class org.apache.mahout.text.SequenceFilesFromDirectory
 
run(String[]) - Method in class org.apache.mahout.text.SequenceFilesFromDirectory
 
run(String[]) - Method in class org.apache.mahout.text.TextParagraphSplittingJob
 
run(String[]) - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
run(String[]) - Method in class org.apache.mahout.utils.eval.DatasetSplitter
 
run(String[]) - Method in class org.apache.mahout.utils.eval.InMemoryFactorizationEvaluator
 
run(String[]) - Method in class org.apache.mahout.utils.eval.ParallelFactorizationEvaluator
 
run(String[]) - Method in class org.apache.mahout.utils.vectors.RowIdJob
 
runJob(Path, Path, String) - Static method in class org.apache.mahout.clustering.conversion.InputDriver
 
runJob(Path, Path) - Static method in class org.apache.mahout.clustering.conversion.meanshift.InputDriver
 

S

separation() - Method in class org.apache.mahout.clustering.cdbw.CDbwEvaluator
Calculate the separation of clusters (eqn 4) taking into account both the distances between the clusters' closest points and the Inter-cluster density.
SEQ_FILE_DIR_OPTION - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 
SequenceFileDumper - Class in org.apache.mahout.utils
 
SequenceFilesFromCsvFilter - Class in org.apache.mahout.text
Implements an example csv to sequence file parser.
SequenceFilesFromCsvFilter(Configuration, String, Map<String, String>, ChunkedWriter) - Constructor for class org.apache.mahout.text.SequenceFilesFromCsvFilter
 
SequenceFilesFromDirectory - Class in org.apache.mahout.text
Converts a directory of text documents into SequenceFiles of Specified chunkSize.
SequenceFilesFromDirectory() - Constructor for class org.apache.mahout.text.SequenceFilesFromDirectory
 
SequenceFilesFromDirectoryFilter - Class in org.apache.mahout.text
Implement this interface if you wish to extend SequenceFilesFromDirectory with your own parsing logic.
SequenceFilesFromDirectoryFilter() - Constructor for class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
SequenceFilesFromDirectoryFilter(Configuration, String, Map<String, String>, ChunkedWriter) - Constructor for class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 
SequenceFilesFromMailArchives - Class in org.apache.mahout.text
Converts a directory of gzipped mail archives into SequenceFiles of specified chunkSize.
SequenceFilesFromMailArchives() - Constructor for class org.apache.mahout.text.SequenceFilesFromMailArchives
 
SequenceFilesFromMailArchives.ChunkedWriter - Class in org.apache.mahout.text
 
SequenceFilesFromMailArchives.ChunkedWriter(int, String) - Constructor for class org.apache.mahout.text.SequenceFilesFromMailArchives.ChunkedWriter
 
SequenceFilesFromMailArchives.PrefixAdditionFilter - Class in org.apache.mahout.text
 
SequenceFilesFromMailArchives.PrefixAdditionFilter(String, SequenceFilesFromMailArchives.ChunkedWriter, Charset) - Constructor for class org.apache.mahout.text.SequenceFilesFromMailArchives.PrefixAdditionFilter
 
SequenceFileVectorWriter - Class in org.apache.mahout.utils.vectors.io
Writes out Vectors to a SequenceFile.
SequenceFileVectorWriter(SequenceFile.Writer) - Constructor for class org.apache.mahout.utils.vectors.io.SequenceFileVectorWriter
 
serializeBenchmark() - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
setExpectations(String, int, boolean, boolean) - Method in class org.apache.mahout.utils.vectors.lucene.TFDFMapper
 
setIdField(String) - Method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
setNumTopFeatures(int) - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
setOutput(String) - Method in class org.apache.mahout.utils.vectors.lucene.ClusterLabels
 
setOutputFile(String) - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
setRelation(String) - Method in interface org.apache.mahout.utils.vectors.arff.ARFFModel
 
setRelation(String) - Method in class org.apache.mahout.utils.vectors.arff.MapBackedARFFModel
 
setSubString(int) - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
setTermDictionary(String, String) - Method in class org.apache.mahout.utils.clustering.ClusterDumper
 
setup(Mapper<LongWritable, Text, Text, VectorWritable>.Context) - Method in class org.apache.mahout.clustering.conversion.InputMapper
 
setup(Mapper<IntWritable, WeightedVectorWritable, IntWritable, WeightedVectorWritable>.Context) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsMapper
 
setup(Reducer<IntWritable, WeightedVectorWritable, IntWritable, VectorWritable>.Context) - Method in class org.apache.mahout.clustering.evaluation.RepresentativePointsReducer
 
STATE_IN_KEY - Static variable in class org.apache.mahout.clustering.evaluation.RepresentativePointsDriver
 
SUBSTRING_OPTION - Static variable in class org.apache.mahout.utils.clustering.ClusterDumper
 

T

TermEntry - Class in org.apache.mahout.utils.vectors
 
TermEntry(String, int, int) - Constructor for class org.apache.mahout.utils.vectors.TermEntry
 
TermInfo - Interface in org.apache.mahout.utils.vectors
 
TermInfoWriter - Interface in org.apache.mahout.utils.vectors.io
 
TextParagraphSplittingJob - Class in org.apache.mahout.text
 
TextParagraphSplittingJob() - Constructor for class org.apache.mahout.text.TextParagraphSplittingJob
 
TextParagraphSplittingJob.SplitMap - Class in org.apache.mahout.text
 
TextParagraphSplittingJob.SplitMap() - Constructor for class org.apache.mahout.text.TextParagraphSplittingJob.SplitMap
 
TextualVectorWriter - Class in org.apache.mahout.utils.vectors.io
Write out the vectors to any Writer using Vector.asFormatString(), one per line by default.
TextualVectorWriter(Writer) - Constructor for class org.apache.mahout.utils.vectors.io.TextualVectorWriter
 
TFDFMapper - Class in org.apache.mahout.utils.vectors.lucene
Not thread-safe
TFDFMapper(IndexReader, Weight, TermInfo) - Constructor for class org.apache.mahout.utils.vectors.lucene.TFDFMapper
 
tokenStream(String, Reader) - Method in class org.apache.mahout.text.MailArchivesClusteringAnalyzer
 
toString() - Method in class org.apache.mahout.benchmark.VectorBenchmarks
 
totalTerms(String) - Method in class org.apache.mahout.utils.vectors.lucene.CachedTermInfo
 
totalTerms(String) - Method in interface org.apache.mahout.utils.vectors.TermInfo
 

V

VALUE_COLUMN_OPTION - Static variable in class org.apache.mahout.text.SequenceFilesFromCsvFilter
 
valueOf(String) - Static method in enum org.apache.mahout.utils.vectors.arff.ARFFType
Returns the enum constant of this type with the specified name.
values() - Static method in enum org.apache.mahout.utils.vectors.arff.ARFFType
Returns an array containing the constants of this enum type, in the order they are declared.
VectorBenchmarks - Class in org.apache.mahout.benchmark
 
VectorBenchmarks(int, int, int, int, int, int) - Constructor for class org.apache.mahout.benchmark.VectorBenchmarks
 
VectorDumper - Class in org.apache.mahout.utils.vectors
Can read in a SequenceFile of Vectors and dump out the results using Vector.asFormatString() to either the console or to a file.
VectorHelper - Class in org.apache.mahout.utils.vectors
 
VectorMapper - Class in org.apache.mahout.utils.vectors.lucene
Not thread-safe
VectorMapper() - Constructor for class org.apache.mahout.utils.vectors.lucene.VectorMapper
 
vectorToCSVString(Vector, boolean) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
 
vectorToCSVString(Vector, boolean, Appendable) - Static method in class org.apache.mahout.utils.vectors.VectorHelper
 
VectorWriter - Interface in org.apache.mahout.utils.vectors.io
 

W

write(String, String) - Method in class org.apache.mahout.text.ChunkedWriter
 
write(String, String) - Method in class org.apache.mahout.text.SequenceFilesFromMailArchives.ChunkedWriter
 
write(TermInfo) - Method in class org.apache.mahout.utils.vectors.io.DelimitedTermInfoWriter
 
write(Iterable<Vector>, long) - Method in class org.apache.mahout.utils.vectors.io.SequenceFileVectorWriter
 
write(Vector) - Method in class org.apache.mahout.utils.vectors.io.SequenceFileVectorWriter
 
write(Iterable<Vector>) - Method in class org.apache.mahout.utils.vectors.io.SequenceFileVectorWriter
 
write(TermInfo) - Method in interface org.apache.mahout.utils.vectors.io.TermInfoWriter
 
write(Iterable<Vector>) - Method in class org.apache.mahout.utils.vectors.io.TextualVectorWriter
 
write(Iterable<Vector>, long) - Method in class org.apache.mahout.utils.vectors.io.TextualVectorWriter
 
write(Vector) - Method in class org.apache.mahout.utils.vectors.io.TextualVectorWriter
 
write(Iterable<Vector>) - Method in interface org.apache.mahout.utils.vectors.io.VectorWriter
Write all values in the Iterable to the output
write(Vector) - Method in interface org.apache.mahout.utils.vectors.io.VectorWriter
Write out a vector
write(Iterable<Vector>, long) - Method in interface org.apache.mahout.utils.vectors.io.VectorWriter
Write the first maxDocs to the output.
writer - Variable in class org.apache.mahout.text.SequenceFilesFromDirectoryFilter
 

A B C D F G I K L M N O P R S T V W

Copyright © 2008-2011 The Apache Software Foundation. All Rights Reserved.