org.apache.mahout.classifier.bayes
Class CBayesAlgorithm

java.lang.Object
  extended by org.apache.mahout.classifier.bayes.CBayesAlgorithm
All Implemented Interfaces:
Algorithm

public class CBayesAlgorithm
extends Object

Class implementing the Complementary Naive Bayes Classifier Algorithm


Constructor Summary
CBayesAlgorithm()
           
 
Method Summary
 ClassifierResult classifyDocument(String[] document, Datastore datastore, String defaultCategory)
          Classify the document and return the Result
 ClassifierResult[] classifyDocument(String[] document, Datastore datastore, String defaultCategory, int numResults)
          Classify the document and return the top numResults
 double documentWeight(Datastore datastore, String label, String[] document)
          Calculate the document weight as the dot product of document vector and the corresponding weight vector of a particular class
 double featureWeight(Datastore datastore, String label, String feature)
          Get the weighted probability of the feature.
 Collection<String> getLabels(Datastore datastore)
          Returns the labels in the given Model
 void initialize(Datastore datastore)
          Initialize the data store and verifies the data in it.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

CBayesAlgorithm

public CBayesAlgorithm()
Method Detail

classifyDocument

public ClassifierResult classifyDocument(String[] document,
                                         Datastore datastore,
                                         String defaultCategory)
                                  throws InvalidDatastoreException
Description copied from interface: Algorithm
Classify the document and return the Result

Parameters:
document - The document to classify
datastore - The data store(InMemory)
defaultCategory - The default category to assign Ties are broken by comparing the category
Returns:
A Collection of ClassifierResults.
Throws:
InvalidDatastoreException

featureWeight

public double featureWeight(Datastore datastore,
                            String label,
                            String feature)
                     throws InvalidDatastoreException
Description copied from interface: Algorithm
Get the weighted probability of the feature.

Parameters:
datastore - The Datastore (InMemory)
label - The label of the feature
feature - The feature to calc. the prob. for
Returns:
The weighted probability
Throws:
InvalidDatastoreException

documentWeight

public double documentWeight(Datastore datastore,
                             String label,
                             String[] document)
Description copied from interface: Algorithm
Calculate the document weight as the dot product of document vector and the corresponding weight vector of a particular class

Parameters:
datastore - The Datastore (InMemory)
label - The label to calculate the probability of
document - The document
Returns:
The probability
See Also:
Algorithm.featureWeight(Datastore, String, String)

classifyDocument

public ClassifierResult[] classifyDocument(String[] document,
                                           Datastore datastore,
                                           String defaultCategory,
                                           int numResults)
                                    throws InvalidDatastoreException
Description copied from interface: Algorithm
Classify the document and return the top numResults

Specified by:
classifyDocument in interface Algorithm
Parameters:
document - The document to classify
datastore - The Datastore (InMemory)
defaultCategory - The default category to assign
numResults - The maximum number of results to return, ranked by score. Ties are broken by comparing the category
Returns:
A Collection of ClassifierResults.
Throws:
InvalidDatastoreException

initialize

public void initialize(Datastore datastore)
                throws InvalidDatastoreException
Description copied from interface: Algorithm
Initialize the data store and verifies the data in it.

Specified by:
initialize in interface Algorithm
Throws:
InvalidDatastoreException

getLabels

public Collection<String> getLabels(Datastore datastore)
                             throws InvalidDatastoreException
Description copied from interface: Algorithm
Returns the labels in the given Model

Specified by:
getLabels in interface Algorithm
Parameters:
datastore - The Datastore (InMemory)
Returns:
Collection of labels
Throws:
InvalidDatastoreException


Copyright © 2008-2012 The Apache Software Foundation. All Rights Reserved.