github BerriAI/litellm v1.41.23

latest releases: v1.52.2-dev1, v1.52.3, v1.52.2...
3 months ago

What's Changed

  • updates cost tracking example code in docs to resolve errors by @djliden in #4714
  • Admin UI - Stack Cache hits vs misses on UI by @ishaan-jaff in #4715
  • [Fix] /audio/transcription - don't write to the local file system by @ishaan-jaff in #4719
  • [Feat] - set max file size on /audio/transcriptions by @ishaan-jaff in #4724
  • [Feature]: Add Fireworks AI data to model_prices_and_context_window.json by @danielbichuetti in #4721
  • fix(utils.py): allow passing dynamic api base for openai-compatible endpoints (Fireworks AI, etc.) by @krrishdholakia in #4723

Full Changelog: v1.41.22...v1.41.23

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.41.23

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name Status Median Response Time (ms) Average Response Time (ms) Requests/s Failures/s Request Count Failure Count Min Response Time (ms) Max Response Time (ms)
/chat/completions Passed ✅ 130.0 154.4940451549665 6.426530640686624 0.0 1923 0 114.97331199996097 2915.000806999956
Aggregated Passed ✅ 130.0 154.4940451549665 6.426530640686624 0.0 1923 0 114.97331199996097 2915.000806999956

Don't miss a new litellm release

NewReleases is sending notifications on new releases.