github BerriAI/litellm v1.41.23-stable

latest releases: v1.52.1.dev1, v1.52.1, v1.52.0.dev20...
3 months ago

What's Changed

  • updates cost tracking example code in docs to resolve errors by @djliden in #4714
  • Admin UI - Stack Cache hits vs misses on UI by @ishaan-jaff in #4715
  • [Fix] /audio/transcription - don't write to the local file system by @ishaan-jaff in #4719
  • [Feat] - set max file size on /audio/transcriptions by @ishaan-jaff in #4724
  • [Feature]: Add Fireworks AI data to model_prices_and_context_window.json by @danielbichuetti in #4721
  • fix(utils.py): allow passing dynamic api base for openai-compatible endpoints (Fireworks AI, etc.) by @krrishdholakia in #4723

Full Changelog: v1.41.22.dev4...v1.41.23-stable

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.41.23-stable

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name Status Median Response Time (ms) Average Response Time (ms) Requests/s Failures/s Request Count Failure Count Min Response Time (ms) Max Response Time (ms)
/chat/completions Passed ✅ 82 92.999348695874 6.567547145524842 0.0 1963 0 69.63735799999426 638.380882000007
Aggregated Passed ✅ 82 92.999348695874 6.567547145524842 0.0 1963 0 69.63735799999426 638.380882000007

Don't miss a new litellm release

NewReleases is sending notifications on new releases.