Key | Summary | Status | Resolution | By |
---|
OPENNLP-510 | Maven dependency on jwnl is broken | Closed | Fixed | William Colen |
OPENNLP-568 | Doccat command line tagger should assume whitespace tokenized input | Closed | Fixed | Joern Kottmann |
OPENNLP-564 | DeTokenizer Rule File for german language | Closed | Fixed | Joern Kottmann |
OPENNLP-562 | invoking .find() on a RegexNameFinder instance brings back Spans with identical start/end indices | Closed | Fixed | James Kosin |
OPENNLP-559 | Enable UIMA Parser module | Closed | Fixed | Joern Kottmann |
OPENNLP-553 | Quasinewton trainer test should not write model to current working directory | Closed | Fixed | Joern Kottmann |
OPENNLP-549 | Inconsistent handling of lower-/upper- case POS tags in the JWNLDictionary.getLemmas method | Closed | Fixed | Aliaksandr Autayeu |
OPENNLP-530 | AD NameFinder formatter is not working correctly with contractions | Closed | Fixed | William Colen |
OPENNLP-529 | AD formatter is not working with Amazonia corpus | Closed | Fixed | William Colen |
OPENNLP-527 | No way to close FileEventStreams | Closed | Fixed | Joern Kottmann |
OPENNLP-524 | Tokenizer does not load 1.5.0 sourceforge model | Closed | Fixed | William Colen |
OPENNLP-520 | Artifacts of legacy models are not correctly validated by tools that implements the factory mechanism | Closed | Fixed | William Colen |
OPENNLP-508 | Add an option to create or expand a TagDictionary with training data | Closed | Fixed | William Colen |
OPENNLP-502 | Doccat trainer should use default feature generator if non is provided | Closed | Fixed | Joern Kottmann |
OPENNLP-500 | Improve OSGi support for OpenNLP extensions | Closed | Fixed | Joern Kottmann |
OPENNLP-499 | Span Comparable implementation should be consistent with equals | Closed | Fixed | William Colen |
OPENNLP-495 | DictionaryNameFinder only outputs Spans of type "default" | Closed | Fixed | Joern Kottmann |
OPENNLP-486 | Corferencer should be integrated into the command line interface | Closed | Fixed | Joern Kottmann |
OPENNLP-485 | Improve the AD NameSample formater | Closed | Fixed | William Colen |
OPENNLP-484 | Improve features related to abbreviation dictionary in Tokenizer | Closed | Fixed | William Colen |
OPENNLP-483 | Refactor the DefaultTokenContextGenerator to make it easier to create a sub-class | Closed | Fixed | William Colen |
OPENNLP-482 | Create a Factory to customize the Tokenizer | Closed | Fixed | William Colen |
OPENNLP-481 | ADTokenSampleStream is not handling hyphenated tokens correctly | Closed | Fixed | William Colen |
OPENNLP-479 | Features related to abbreviation dictionary are not properly collected by DefaultSDContextGenerator | Closed | Fixed | William Colen |
OPENNLP-478 | NameSample should create typed span | Closed | Fixed | William Colen |
OPENNLP-477 | DictionaryNameFinder evaluation always returns 0, 0, -1 | Closed | Fixed | William Colen |
OPENNLP-471 | DictionaryNameFinder has HASHing issues | Closed | Fixed | James Kosin |
OPENNLP-470 | CLI for SimpleTokenizer is broken | Closed | Fixed | Aliaksandr Autayeu |
OPENNLP-452 | Running the POSTaggerCrossValidator with -ngram argument causes an exception | Closed | Fixed | William Colen |
OPENNLP-450 | Add additional context support to POS Tagger | Closed | Fixed | William Colen |
OPENNLP-445 | OpenNLP TLP: Update source location in POM | Closed | Fixed | Aliaksandr Autayeu |
OPENNLP-439 | OpenNLP TLP: Remove Incubator disclaimer | Closed | Fixed | Aliaksandr Autayeu |
OPENNLP-437 | OpenNLP TLP: Update source code section to new subversion location | Closed | Fixed | Aliaksandr Autayeu |
OPENNLP-426 | Parse.insert method should not throw an InternalError | Closed | Fixed | Joern Kottmann |
OPENNLP-418 | CrossValidator tools argument parser not working with minimun arguments | Closed | Fixed | Aliaksandr Autayeu |
OPENNLP-417 | Back-to-Back <START><END> tags get improperly set when tagging | Closed | Fixed | James Kosin |
OPENNLP-415 | NameFinder Dictionary Search Not Working Correctly | Closed | Fixed | James Kosin |
OPENNLP-403 | Token feature generation is not working when using feature generator descriptor | Closed | Fixed | Joern Kottmann |
OPENNLP-402 | CLI tools and formats refactored | Closed | Fixed | Joern Kottmann |
OPENNLP-396 | Name Finder evaluator does not work with old data format | Closed | Fixed | Joern Kottmann |
OPENNLP-395 | UIMA Name Finder trainer does not set clear adaptive data flag correctly | Closed | Fixed | Joern Kottmann |
OPENNLP-394 | Name Finder cross validation should be done on a document level | Closed | Fixed | Joern Kottmann |
OPENNLP-381 | Error messages for command line arguments introduced into command line tools | Closed | Fixed | Joern Kottmann |
OPENNLP-367 | File Encoding Issues | Closed | Fixed | James Kosin |
OPENNLP-361 | handling spaces in JAVA_HOME | Closed | Fixed | James Kosin |
OPENNLP-338 | Add L-BFGS parameter estimation training to maxent | Closed | Fixed | Hyosup Shim |
OPENNLP-337 | Move the Porter Stemmer to OpenNLP Tools | Closed | Fixed | Boris Galitsky |
OPENNLP-218 | Add a chunker training api section | Closed | Fixed | Aliaksandr Autayeu |
OPENNLP-215 | Add Tokenizer Training API section | Closed | Fixed | Aliaksandr Autayeu |
OPENNLP-46 | Write documentation for the CONLL02 converter | Closed | Fixed | Joern Kottmann |
OPENNLP-554 | Java Update 7_10 breaks -jar with wildcard support | Closed | Fixed | James Kosin |
OPENNLP-541 | Improve ADChunkSampleStream | Closed | Fixed | William Colen |
OPENNLP-539 | Create a factory to customize the Chunker | Closed | Fixed | William Colen |
OPENNLP-536 | Sentence Detector trainer should create a trace file during training | Closed | Fixed | Joern Kottmann |
OPENNLP-535 | Tokenizer trainer should create a trace file during training | Closed | Fixed | |
OPENNLP-534 | Parser training no longer has option to learn/generate function tags | Closed | Fixed | Joern Kottmann |
OPENNLP-526 | Exception cleanup in opennlp-maxent/uima | Closed | Fixed | Aliaksandr Autayeu |
OPENNLP-525 | Exception cleanup in opennlp-tools | Closed | Fixed | Aliaksandr Autayeu |
OPENNLP-523 | Add a French detokenizer dictionary | Closed | Fixed | |
OPENNLP-522 | Improve exceptions in BaseModel | Closed | Fixed | Joern Kottmann |
OPENNLP-521 | Skip POSTag dictionary validation during runtime | Closed | Fixed | William Colen |
OPENNLP-519 | Language Detector sample descriptor does not specify a resource manager configuration | Closed | Fixed | Joern Kottmann |
OPENNLP-517 | Sentence Detector Trainer Analysis Engine should support custom end-of-sentence chars | Closed | Fixed | Joern Kottmann |
OPENNLP-507 | Parse.toString should output the default format and not just the covered text | Closed | Fixed | Joern Kottmann |
OPENNLP-505 | Models should also have constructors which accept URL and File objects | Closed | Fixed | Joern Kottmann |
OPENNLP-493 | TokenNameFinderTrainer should have an option to only use certain name types for training | Closed | Fixed | Joern Kottmann |
OPENNLP-492 | Method getTokensOrderedByFrequency in POSTaggerFineGrainedReportListener probably has a typo. | Closed | Fixed | William Colen |
OPENNLP-491 | Chunking parser throws NPE when incomplete parse has zero tokens | Closed | Fixed | Joern Kottmann |
OPENNLP-490 | Add MERGE_BOTH option to Detokenizer | Closed | Fixed | William Colen |
OPENNLP-474 | Name Finders cross validation is broken | Closed | Fixed | Joern Kottmann |
OPENNLP-473 | Implement a directory training file reader stream | Closed | Fixed | Joern Kottmann |
OPENNLP-463 | Docs incosistent with latest code | Closed | Fixed | William Colen |
OPENNLP-449 | Implement a fine-grained evaluation report for POS Tagger | Closed | Fixed | William Colen |
OPENNLP-448 | Add Arvores Deitadas TokenSampleStream | Closed | Fixed | William Colen |
OPENNLP-435 | Support loading custom format factory classes in CLI | Closed | Fixed | Joern Kottmann |
OPENNLP-434 | Create a Factory to customize the Sentence Detector | Closed | Fixed | William Colen |
OPENNLP-432 | POSModel validation should inform the invalid POS tags of the POSDictionary | Closed | Fixed | William Colen |
OPENNLP-431 | Debugging is slow if using a POSDictionary | Closed | Fixed | William Colen |
OPENNLP-430 | It is missing a way to set case sensitivity while creating a POSDictionary | Closed | Fixed | William Colen |
OPENNLP-429 | Create a Factory to customize the POS Tagger | Closed | Fixed | William Colen |
OPENNLP-428 | make EOS character set configurable | Closed | Fixed | William Colen |
OPENNLP-427 | UIMA Parser integration fails if document does not has sentences or tokens | Closed | Fixed | Joern Kottmann |
OPENNLP-425 | Create sample descriptor for Parser Analysis Engine | Closed | Fixed | Joern Kottmann |
OPENNLP-424 | Include a formatter that creates a stream of POSSample from AD corpus | Closed | Fixed | William Colen |
OPENNLP-423 | Improve Portuguese NameSample and ChunkSample formatters | Closed | Fixed | William Colen |
OPENNLP-422 | Include a formater that creates a stream of SentenceSample from AD corpus | Closed | Fixed | William Colen |
OPENNLP-407 | Restore old Name Finder tool which is needed by TreebankLinker | Closed | Fixed | Joern Kottmann |
OPENNLP-406 | Dev version 0.0.0-SNAPSHOT should never fail model loading | Closed | Fixed | Joern Kottmann |
OPENNLP-404 | Explain generic usage of OpenNLP in introduction | Closed | Fixed | Aliaksandr Autayeu |
OPENNLP-400 | Add a sample feature generator description for German | Closed | Fixed | Joern Kottmann |
OPENNLP-399 | Feature generator xml description should support definition of suffix and prefix generator | Closed | Fixed | Joern Kottmann |
OPENNLP-398 | Add a sample trainer parameter file to the lang package | Closed | Fixed | Joern Kottmann |
OPENNLP-386 | UIMA trainer should be able to dump the training data in the OpenNLP format to a file | Closed | Fixed | Joern Kottmann |
OPENNLP-382 | Old way of encoding parameter processing replaced with new one | Closed | Fixed | Joern Kottmann |
OPENNLP-376 | UIMA Name Finder trainer should support feature generation xml file | Closed | Fixed | Joern Kottmann |
OPENNLP-375 | UIMA based trainers should support maxent properties file | Closed | Fixed | Joern Kottmann |
OPENNLP-372 | Banner with version for CLI tool | Closed | Fixed | Joern Kottmann |
OPENNLP-370 | Generics in GISModelWriter.compressOutcomes | Closed | Fixed | Joern Kottmann |
OPENNLP-369 | loops improved in maxent | Closed | Fixed | Joern Kottmann |
OPENNLP-366 | Java5: generics to avoid casts | Closed | Fixed | Joern Kottmann |
OPENNLP-365 | Java5 nuisances: boxing\unboxing, extra casts, shorter loops | Closed | Fixed | Joern Kottmann |
OPENNLP-364 | Use StringBuilder instead of StringBuffer | Closed | Fixed | Joern Kottmann |
OPENNLP-362 | JavaDoc warnings | Closed | Fixed | Joern Kottmann |
OPENNLP-318 | Build of the UIMA Integration needs to inject version number | Closed | Fixed | Joern Kottmann |
OPENNLP-552 | Remove the legacy META-INF folder from opennlp-maxent | Closed | Fixed | |
OPENNLP-516 | CreateModel should use PerceptronModelWriter when -perceptron option is specified | Closed | Fixed | Joern Kottmann |
OPENNLP-512 | Create sample descriptor for doccat Language Detector AE | Closed | Fixed | Joern Kottmann |
OPENNLP-433 | Parser should insert all nodes into CAS | Closed | Fixed | Joern Kottmann |
OPENNLP-380 | Remove old left over assembly folders | Closed | Fixed | Joern Kottmann |
OPENNLP-344 | Build should inject version into the RELEASE_NOTES.html file | Closed | Fixed | Joern Kottmann |