0.0.7 (2024-06-28)
Breaking Changes
batch_decode_with_padded_kv_cachewas removed, we encourage user to useBatchDecodeWithPagedKVCacheWrapperinstead. (#343)
Bugfix
- fix the
forward_return_lsefunction inBatchPrefillWithRaggedKVCacheclass (#337) - fix the scheduler behavior of large page size (#333)