Uses already seen data (the indexed documents) to classify new documents. Currently only contains a (simplistic) Lucene based Naive Bayes classifier, a k-Nearest Neighbor classifier and a Perceptron based classifier