- Remove preparation scripts and distribute precompiled binaries #87
- Add DualConnector, a faster and smaller dictionary format #86
Precompiled dictionary files
We provide precompiled dictionaries for Vibrato, allowing you to get started with tokenization easily. You can download them from Assets in this release.
The following three variants are distributed:
ipadic-mecab-2_7_0/system.dicfrom IPADIC v2.7.0
jumandic-mecab-7_0/system.dicfrom mecab-jumandic-utf8 v7.0
naist-jdic-mecab-0_6_3b/system.dicfrom NAIST Japanese Dictionary v0.6.3b
unidic-mecab-2_1_2/system.dicfrom UniDic v2.1.2
unidic-cwj-3_1_1/system.dicfrom UniDic v3.1.1
These system dictionaries were compiled and modified in the manners described in compile.md and map.md. We trained the mappings of connection ids using license-expired data obtained from Aozora Bunko, following the guideline.
The licenses are contained in each file.