What's New
🧠 Extended Thinking/Reasoning Mode Fix
Fixed extended thinking mode for Claude models accessed through Vertex AI/Antigravity backend (e.g., gemini-claude-opus-4-5-thinking).
Root causes fixed:
- CLIProxyAPI:
geminiToAntigravity()was incorrectly deletingmaxOutputTokensand resettingthinkingBudget=-1for all non-Gemini-3 models, including Claude - CLIProxyAPI:
ensureMaxTokensForThinking()only checked snake_casebudget_tokens, missing camelCasebudgetTokens - ProxyPal: OpenCode config used
budget_tokensbut AI SDK expectsbudgetTokens(camelCase) - ProxyPal: Output limit was too low (16384) for thinking models with
budgetTokens=16000
Changes:
- Added
isClaudecheck to preserve thinking config for Claude models via Antigravity - Support both
budget_tokensandbudgetTokensformats - Increased output limit to 32000 for
-thinkingsuffix models - Rebuilt CLIProxyAPI binary with fixes
This enables OpenCode and other AI tools to properly display Claude's thinking blocks when using ProxyPal with Vertex AI backend.