xorbitsai/inference v0.1.1
on GitHub

latest releases: v0.16.3, v0.16.2, v0.16.1...

15 months ago

What's new in 0.1.1 (2023-08-03)

These are the changes in inference v0.1.1.

New features

FEAT: add opt-125m pytorch model and add ut by @pangyoki in #263
FEAT: support falcon 40b pytorch model by @pangyoki in #278
FEAT: pytorch model embeddings by @jiayini1119 in #282
FEAT: support falcon-instruct 7b and 40b pytorch model by @jiayini1119 in #287
FEAT: support chatglm/chatglm2/chatglm2-32k pytorch model by @pangyoki in #283
FEAT: support qwen 7b by @UranusSeven in #294

Enhancements

ENH: Support Enviroment Variable by @RayJi01 in #285
REF: split supervisor and worker by @UranusSeven in #279

Bug fixes

BUG: fix import torch error even if user don't want to launch torch model by @pangyoki in #274
BUG: empty legacy model dir by @UranusSeven in #276

Tests

TST: add benchmark script by @pangyoki in #281

Documentation

DOC: Update README_ja_JP.md by @eltociear in #269
DOC: add docstring to client methods by @RayJi01 in #247

Full Changelog: v0.1.0...v0.1.1

Check out latest releases or
releases around xorbitsai/inference v0.1.1

Don't miss a new inference release

NewReleases is sending notifications on new releases.

Get notifications