Methods in opennlp.tools.tokenize that return TokenizerModel |
static TokenizerModel |
TokenizerME.train(String languageCode,
ObjectStream<TokenSample> samples,
boolean useAlphaNumericOptimization)
Trains a model for the TokenizerME with a default cutoff of 5 and 100 iterations. |
static TokenizerModel |
TokenizerME.train(String languageCode,
ObjectStream<TokenSample> samples,
boolean useAlphaNumericOptimization,
int cutoff,
int iterations)
Deprecated. use TokenizerME.train(String, ObjectStream, boolean, TrainingParameters)
instead and pass in a TrainingParameters object. |
static TokenizerModel |
TokenizerME.train(String languageCode,
ObjectStream<TokenSample> samples,
boolean useAlphaNumericOptimization,
TrainingParameters mlParams)
Trains a model for the TokenizerME . |
static TokenizerModel |
TokenizerME.train(String languageCode,
ObjectStream<TokenSample> samples,
Dictionary abbreviations,
boolean useAlphaNumericOptimization,
TrainingParameters mlParams)
Trains a model for the TokenizerME . |