github michaelfeil/infinity 0.0.32

latest releases: 0.0.51, 0.0.50, 0.0.49...
2 months ago

What's Changed

You can now run a model with a alias. This will help you communicating with the API.

infinity_emb --served-model-name "your_nickname"

You can now use preload models. This acts as a "run download and load into ram" test. Upon execution, all files are cached, which will speedup consecutive loads. For additonal speedups, use --no-model-warmup to skip model warmup after loading.

infinity_emb --preload-only --model--name-or-path BAAI/bge-large-en-v1.5 

PR's

New Contributors

Full Changelog: 0.0.31...0.0.32

Don't miss a new infinity release

NewReleases is sending notifications on new releases.