4.145.0
Minor Changes
Patch Changes
-
#4876
7010f60Thanks @markijbema! - Autocomplete: Show entire suggestion when first line has no word characters -
#4183
de30ffaThanks @sebastiand-cerebras! - fix(cerebras): use conservative max_tokens and add integration headerConservative max_tokens:
Cerebras rate limiter estimates token consumption using max_completion_tokens upfront rather than actual usage. When agentic tools automatically set this to the model maximum (e.g., 64K), users exhaust their quota prematurely and get rate-limited despite minimal actual token consumption.This fix uses a conservative default of 8K tokens instead of the model maximum. This is sufficient for most agentic tool use while preserving rate limit headroom.
Integration header:
AddedX-Cerebras-3rd-Party-Integration: kilocodeheader to all Cerebras API requests for tracking and analytics. -
#4856
100462eThanks @markijbema! - Improve autocomplete tooltip messaging when there's no balanceWhen a user has a Kilo Code account with no credits, the autocomplete status bar now shows a helpful message explaining that they need to add credits to use autocomplete, rather than just showing a generic token error.
-
#4793
4fff873Thanks @mcowger! - Restore various providers to modelCache endpoint to fix outdated entries.