|
|||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |
See:
Description
Class Summary | |
---|---|
KeywordAnalyzer | "Tokenizes" the entire stream as a single token. |
KeywordTokenizer | Emits the entire input as a single token. |
LetterTokenizer | A LetterTokenizer is a tokenizer that divides text at non-letters. |
LowerCaseFilter | Normalizes token text to lower case. |
LowerCaseTokenizer | LowerCaseTokenizer performs the function of LetterTokenizer and LowerCaseFilter together. |
SimpleAnalyzer | An Analyzer that filters LetterTokenizer
with LowerCaseFilter |
StopAnalyzer | Filters LetterTokenizer with LowerCaseFilter and StopFilter . |
StopFilter | Removes stop words from a token stream. |
TypeTokenFilter | Removes tokens whose types appear in a set of blocked types from a token stream. |
WhitespaceAnalyzer | An Analyzer that uses WhitespaceTokenizer . |
WhitespaceTokenizer | A WhitespaceTokenizer is a tokenizer that divides text at whitespace. |
Basic, general-purpose analysis components.
|
|||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |