Bug Fixes
- Fix OpenAI-compatible endpoint compatibility (#171) — Send both
max_tokensandmax_completion_tokensin chat requests to support newer OpenAI models that rejectmax_tokens. Also centralizes URL construction to prevent double-pathing (e.g./v1/v1/chat/completions).
Thanks to @emkman for reporting this issue!