Installation
pip install openllm==0.4.40To upgrade from a previous version, use the following command:
pip install --upgrade openllm==0.4.40Usage
All available models: openllm models
To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta
To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P -v $PWD/data:$HOME/.cache/huggingface/ ghcr.io/bentoml/openllm:0.4.40 start HuggingFaceH4/zephyr-7b-beta
Find more information about this release in the CHANGELOG.md
What's Changed
- fix(infra): conform ruff to 150 LL by @aarnphm in #781
- infra: update blame ignore to formatter hash by @aarnphm in #782
- perf: upgrade mixtral to use expert parallelism by @aarnphm in #783
Full Changelog: v0.4.39...v0.4.40