What's new in 0.1.3 (2023-08-09)
These are the changes in inference v0.1.3.
Enhancements
- ENH: accelerate 4-bit quantization for pytorch model by @pangyoki in #284
- ENH: remove chatglmcpp from deps by @UranusSeven in #329
- ENH: auto detect device in pytorch model by @pangyoki in #322
- ENH: Include model revision by @RayJi01 in #320
Bug fixes
- BUG: fix mps and cuda device detection for pytorch model by @pangyoki in #331
- Bug: Fix grammar mistake in examples by @Bojun-Feng in #336
- BUG: Fix log level on subprocess by @RayJi01 in #335
Documentation
- DOC: fix doc warnings by @UranusSeven in #314
- DOC: add ja_JP and update po files by @UranusSeven in #315
- DOC: custom models by @UranusSeven in #325
Others
Full Changelog: v0.1.2...v0.1.3