JIRA Report

KeySummaryStatusResolutionBy
OPENNLP-510Maven dependency on jwnl is brokenClosedFixedWilliam Colen
OPENNLP-568Doccat command line tagger should assume whitespace tokenized inputClosedFixedJoern Kottmann
OPENNLP-564DeTokenizer Rule File for german languageClosedFixedJoern Kottmann
OPENNLP-562invoking .find() on a RegexNameFinder instance brings back Spans with identical start/end indicesClosedFixedJames Kosin
OPENNLP-559Enable UIMA Parser moduleClosedFixedJoern Kottmann
OPENNLP-553Quasinewton trainer test should not write model to current working directoryClosedFixedJoern Kottmann
OPENNLP-549Inconsistent handling of lower-/upper- case POS tags in the JWNLDictionary.getLemmas methodClosedFixedAliaksandr Autayeu
OPENNLP-530AD NameFinder formatter is not working correctly with contractionsClosedFixedWilliam Colen
OPENNLP-529AD formatter is not working with Amazonia corpusClosedFixedWilliam Colen
OPENNLP-527No way to close FileEventStreamsClosedFixedJoern Kottmann
OPENNLP-524Tokenizer does not load 1.5.0 sourceforge modelClosedFixedWilliam Colen
OPENNLP-520Artifacts of legacy models are not correctly validated by tools that implements the factory mechanismClosedFixedWilliam Colen
OPENNLP-508Add an option to create or expand a TagDictionary with training dataClosedFixedWilliam Colen
OPENNLP-502Doccat trainer should use default feature generator if non is providedClosedFixedJoern Kottmann
OPENNLP-500Improve OSGi support for OpenNLP extensionsClosedFixedJoern Kottmann
OPENNLP-499Span Comparable implementation should be consistent with equalsClosedFixedWilliam Colen
OPENNLP-495DictionaryNameFinder only outputs Spans of type "default"ClosedFixedJoern Kottmann
OPENNLP-486Corferencer should be integrated into the command line interfaceClosedFixedJoern Kottmann
OPENNLP-485Improve the AD NameSample formaterClosedFixedWilliam Colen
OPENNLP-484Improve features related to abbreviation dictionary in TokenizerClosedFixedWilliam Colen
OPENNLP-483Refactor the DefaultTokenContextGenerator to make it easier to create a sub-classClosedFixedWilliam Colen
OPENNLP-482Create a Factory to customize the TokenizerClosedFixedWilliam Colen
OPENNLP-481ADTokenSampleStream is not handling hyphenated tokens correctlyClosedFixedWilliam Colen
OPENNLP-479Features related to abbreviation dictionary are not properly collected by DefaultSDContextGeneratorClosedFixedWilliam Colen
OPENNLP-478NameSample should create typed spanClosedFixedWilliam Colen
OPENNLP-477DictionaryNameFinder evaluation always returns 0, 0, -1ClosedFixedWilliam Colen
OPENNLP-471DictionaryNameFinder has HASHing issuesClosedFixedJames Kosin
OPENNLP-470CLI for SimpleTokenizer is brokenClosedFixedAliaksandr Autayeu
OPENNLP-452Running the POSTaggerCrossValidator with -ngram argument causes an exceptionClosedFixedWilliam Colen
OPENNLP-450Add additional context support to POS TaggerClosedFixedWilliam Colen
OPENNLP-445OpenNLP TLP: Update source location in POMClosedFixedAliaksandr Autayeu
OPENNLP-439OpenNLP TLP: Remove Incubator disclaimerClosedFixedAliaksandr Autayeu
OPENNLP-437OpenNLP TLP: Update source code section to new subversion locationClosedFixedAliaksandr Autayeu
OPENNLP-426Parse.insert method should not throw an InternalError ClosedFixedJoern Kottmann
OPENNLP-418CrossValidator tools argument parser not working with minimun argumentsClosedFixedAliaksandr Autayeu
OPENNLP-417Back-to-Back <START><END> tags get improperly set when taggingClosedFixedJames Kosin
OPENNLP-415NameFinder Dictionary Search Not Working CorrectlyClosedFixedJames Kosin
OPENNLP-403Token feature generation is not working when using feature generator descriptorClosedFixedJoern Kottmann
OPENNLP-402CLI tools and formats refactoredClosedFixedJoern Kottmann
OPENNLP-396Name Finder evaluator does not work with old data formatClosedFixedJoern Kottmann
OPENNLP-395UIMA Name Finder trainer does not set clear adaptive data flag correctlyClosedFixedJoern Kottmann
OPENNLP-394Name Finder cross validation should be done on a document levelClosedFixedJoern Kottmann
OPENNLP-381Error messages for command line arguments introduced into command line toolsClosedFixedJoern Kottmann
OPENNLP-367File Encoding IssuesClosedFixedJames Kosin
OPENNLP-361handling spaces in JAVA_HOMEClosedFixedJames Kosin
OPENNLP-338Add L-BFGS parameter estimation training to maxentClosedFixedHyosup Shim
OPENNLP-337Move the Porter Stemmer to OpenNLP ToolsClosedFixedBoris Galitsky
OPENNLP-218Add a chunker training api section ClosedFixedAliaksandr Autayeu
OPENNLP-215Add Tokenizer Training API sectionClosedFixedAliaksandr Autayeu
OPENNLP-46Write documentation for the CONLL02 converterClosedFixedJoern Kottmann
OPENNLP-554Java Update 7_10 breaks -jar with wildcard supportClosedFixedJames Kosin
OPENNLP-541Improve ADChunkSampleStreamClosedFixedWilliam Colen
OPENNLP-539Create a factory to customize the ChunkerClosedFixedWilliam Colen
OPENNLP-536Sentence Detector trainer should create a trace file during trainingClosedFixedJoern Kottmann
OPENNLP-535Tokenizer trainer should create a trace file during trainingClosedFixed 
OPENNLP-534Parser training no longer has option to learn/generate function tagsClosedFixedJoern Kottmann
OPENNLP-526Exception cleanup in opennlp-maxent/uimaClosedFixedAliaksandr Autayeu
OPENNLP-525Exception cleanup in opennlp-toolsClosedFixedAliaksandr Autayeu
OPENNLP-523Add a French detokenizer dictionaryClosedFixed 
OPENNLP-522Improve exceptions in BaseModelClosedFixedJoern Kottmann
OPENNLP-521Skip POSTag dictionary validation during runtimeClosedFixedWilliam Colen
OPENNLP-519Language Detector sample descriptor does not specify a resource manager configurationClosedFixedJoern Kottmann
OPENNLP-517Sentence Detector Trainer Analysis Engine should support custom end-of-sentence charsClosedFixedJoern Kottmann
OPENNLP-507Parse.toString should output the default format and not just the covered textClosedFixedJoern Kottmann
OPENNLP-505Models should also have constructors which accept URL and File objectsClosedFixedJoern Kottmann
OPENNLP-493TokenNameFinderTrainer should have an option to only use certain name types for trainingClosedFixedJoern Kottmann
OPENNLP-492Method getTokensOrderedByFrequency in POSTaggerFineGrainedReportListener probably has a typo.ClosedFixedWilliam Colen
OPENNLP-491Chunking parser throws NPE when incomplete parse has zero tokensClosedFixedJoern Kottmann
OPENNLP-490Add MERGE_BOTH option to DetokenizerClosedFixedWilliam Colen
OPENNLP-474Name Finders cross validation is brokenClosedFixedJoern Kottmann
OPENNLP-473Implement a directory training file reader streamClosedFixedJoern Kottmann
OPENNLP-463Docs incosistent with latest codeClosedFixedWilliam Colen
OPENNLP-449Implement a fine-grained evaluation report for POS TaggerClosedFixedWilliam Colen
OPENNLP-448Add Arvores Deitadas TokenSampleStreamClosedFixedWilliam Colen
OPENNLP-435Support loading custom format factory classes in CLIClosedFixedJoern Kottmann
OPENNLP-434Create a Factory to customize the Sentence DetectorClosedFixedWilliam Colen
OPENNLP-432POSModel validation should inform the invalid POS tags of the POSDictionaryClosedFixedWilliam Colen
OPENNLP-431Debugging is slow if using a POSDictionaryClosedFixedWilliam Colen
OPENNLP-430It is missing a way to set case sensitivity while creating a POSDictionary ClosedFixedWilliam Colen
OPENNLP-429Create a Factory to customize the POS TaggerClosedFixedWilliam Colen
OPENNLP-428make EOS character set configurableClosedFixedWilliam Colen
OPENNLP-427UIMA Parser integration fails if document does not has sentences or tokensClosedFixedJoern Kottmann
OPENNLP-425Create sample descriptor for Parser Analysis EngineClosedFixedJoern Kottmann
OPENNLP-424Include a formatter that creates a stream of POSSample from AD corpusClosedFixedWilliam Colen
OPENNLP-423Improve Portuguese NameSample and ChunkSample formattersClosedFixedWilliam Colen
OPENNLP-422Include a formater that creates a stream of SentenceSample from AD corpusClosedFixedWilliam Colen
OPENNLP-407Restore old Name Finder tool which is needed by TreebankLinkerClosedFixedJoern Kottmann
OPENNLP-406Dev version 0.0.0-SNAPSHOT should never fail model loadingClosedFixedJoern Kottmann
OPENNLP-404Explain generic usage of OpenNLP in introductionClosedFixedAliaksandr Autayeu
OPENNLP-400Add a sample feature generator description for GermanClosedFixedJoern Kottmann
OPENNLP-399Feature generator xml description should support definition of suffix and prefix generatorClosedFixedJoern Kottmann
OPENNLP-398Add a sample trainer parameter file to the lang packageClosedFixedJoern Kottmann
OPENNLP-386UIMA trainer should be able to dump the training data in the OpenNLP format to a fileClosedFixedJoern Kottmann
OPENNLP-382Old way of encoding parameter processing replaced with new oneClosedFixedJoern Kottmann
OPENNLP-376UIMA Name Finder trainer should support feature generation xml fileClosedFixedJoern Kottmann
OPENNLP-375UIMA based trainers should support maxent properties fileClosedFixedJoern Kottmann
OPENNLP-372Banner with version for CLI toolClosedFixedJoern Kottmann
OPENNLP-370Generics in GISModelWriter.compressOutcomesClosedFixedJoern Kottmann
OPENNLP-369loops improved in maxentClosedFixedJoern Kottmann
OPENNLP-366Java5: generics to avoid castsClosedFixedJoern Kottmann
OPENNLP-365Java5 nuisances: boxing\unboxing, extra casts, shorter loopsClosedFixedJoern Kottmann
OPENNLP-364Use StringBuilder instead of StringBufferClosedFixedJoern Kottmann
OPENNLP-362JavaDoc warningsClosedFixedJoern Kottmann
OPENNLP-318Build of the UIMA Integration needs to inject version numberClosedFixedJoern Kottmann
OPENNLP-552Remove the legacy META-INF folder from opennlp-maxentClosedFixed 
OPENNLP-516CreateModel should use PerceptronModelWriter when -perceptron option is specifiedClosedFixedJoern Kottmann
OPENNLP-512Create sample descriptor for doccat Language Detector AEClosedFixedJoern Kottmann
OPENNLP-433Parser should insert all nodes into CASClosedFixedJoern Kottmann
OPENNLP-380Remove old left over assembly foldersClosedFixedJoern Kottmann
OPENNLP-344Build should inject version into the RELEASE_NOTES.html fileClosedFixedJoern Kottmann