This release removes an experimental feature and improves error handling for rate-limited API requests.
Improvements
- Makes HTTP 429 (Too Many Requests) errors retryable when no fallback model is available, respecting the Retry-After header
Bug Fixes
- Gates 429 retry behavior behind WithRetryOnRateLimit() opt-in option to prevent unexpected retry behavior
Technical Changes
- Removes experimental feature from the codebase
- Adds optional gateway usage for LLM evaluation as a judge
- Refactors to use typed StatusError for retry metadata, with providers wrapping errors at Recv()
What's Changed
- Remove experimental feature by @dgageot in #2087
- docs: update CHANGELOG.md for v1.32.2 by @docker-read-write[bot] in #2090
- This can be retried by @dgageot in #2093
- [eval] Optionnally use the gateway for the llm as a judge by @dgageot in #2092
- fix: make HTTP 429 retryable when no fallback model, respect Retry-After header by @simon-agent-go-expert in #2096
New Contributors
- @simon-agent-go-expert made their first contribution in #2096
Full Changelog: v1.32.2...v1.32.3