Highlights
- Provides a MLX kernel to support mac devices
- Fix bugs in json schema converter
- Fix the compilation failure issue of torch kernels
- Supports float range, format in json schema
- Adds a function testing._is_single_token_bitmask for spec decoding
- Releases GIL for most of the methods
What's Changed
- Bump to v0.1.17 by @Ubospica in #271
- [Feature] LRU cache for grammar compiler by @DarkSharpness in #243
- Setup tmate by @Ubospica in #281
- [Fix] Fix test_json_schema_converter::test_non_strict and nanobind version issue by @Ubospica in #282
- [Fix] Switch release compilation flag from -Ofast to -O3 by @tboerstad in #268
- [Minor] Fix missing mark hf_token_required by @DarkSharpness in #284
- Add module xgrammar.contrib.mlxlm by @wangkuiyi in #273
- Float range support for xgrammar by @zanderjiang in #289
- Disable slow C++ test cases by @Ubospica in #290
- [Feature] Cover JSON schema string format by @Ubospica in #266
- HuggingFace
LogitsProcessor
to accept also list of compiled grammars by @lukaszkolodziejczyk in #275 - update ci for hf token by @zanderjiang in #267
- Release GIL for key methods by @Ubospica in #291
- Add MLX kernel to support Apple Silicon by @wangkuiyi in #287
- [Bugfix] Fix path replacement issue in
apply_token_bitmask_inplace_cuda.py
by @nFunctor in #274 - Refactor kernels by @Ubospica in #293
- Add util function testing._is_single_token_bitmask by @Ubospica in #294
- Bump to v0.1.18 by @Ubospica in #295
New Contributors
- @tboerstad made their first contribution in #268
- @lukaszkolodziejczyk made their first contribution in #275
- @nFunctor made their first contribution in #274
Full Changelog: v0.1.17...v0.1.18