-
Update German UD POS tagger to UD 2.14 data
-
Add Austrian German month names to the German tokenizer: #1454 Thank you @j3ernhard
-
Improve the constituency to dependency converter to remove quite a few validation errors. This includes adding the PTB Corrector as an earlier step when operating specifically on PTB data #1445
-
SSurgeon feature to split one word into multiple words: 13ede5a
-
Unravel recursion in SemanticGraph - 05804a3 Fixes one server crash observed in #1461
-
Package updates: update protobuf -> 3.25.5, javax -> 1.1.6 #1465 Unfortunately updating Lucene to fix all dependency security issues will require dropping Java 8 support
-
Fix the server caching of tokenizer annotators to include segmenter properties as well. Avoids the server not respecting a request for a different segmentation model. 6f6eb93