github chenyme/grok2api v2.0.0.rc0

latest releases: v2.0.4.rc0, v2.0.3, v2.0.2...
13 hours ago

What's Changed

This release is a substantial upgrade across API compatibility, runtime architecture, account orchestration, frontend experience, and deployment flow.
If you are upgrading from an older version, please back up your existing data and configuration first. This release includes several structural changes and is not fully backward-compatible.

Highlights

  • Added Anthropic-compatible Messages API support at POST /v1/messages
  • Added unified tool-call parsing for XML- and JSON-style tool formats
  • Added tokenizer-backed usage estimation with tiktoken
  • Rebuilt the WebUI and Admin surface on top of a unified static asset structure
  • Refactored the backend into platform, control, dataplane, and products
  • Improved account refresh, quota sync, multi-worker scheduling, and proxy handling
  • Tightened authentication boundaries on OpenAI-compatible endpoints
  • Added automatic version-based cache busting for local static assets

API Expansion

Supported and improved API surfaces now include:

  • OpenAI-compatible endpoints:
    • /v1/models
    • /v1/chat/completions
    • /v1/responses
    • /v1/images/generations
    • /v1/images/edits
    • /v1/videos
  • Anthropic-compatible endpoint:
    • /v1/messages
  • Capability coverage across:
    • streaming and non-streaming responses
    • reasoning / thinking output
    • tool-call passthrough and structured parsing
    • image generation
    • image editing
    • video generation
    • locally cached media access

Model registration and exposure were also centralized, making /v1/models cleaner and easier to extend.

Anthropic Compatibility

One of the most important additions in this release is Anthropic-style Messages API support.

This includes:

  • new POST /v1/messages routing
  • SSE streaming support
  • tool-enabled message flows
  • thinking-mode handling
  • compatibility-layer validation and response shaping

This moves the project beyond a single compatibility surface and into broader multi-API support.

Tool Calling and Usage Accounting

Tool calling and usage estimation both received meaningful upgrades.

Included in this work:

  • a unified tool-call parser
  • structured parsing from XML, JSON envelopes, and JSON arrays
  • shared tool prompting / parsing flow
  • tiktoken-based prompt, message, and tool-call estimation
  • more consistent usage reporting across OpenAI-compatible and Anthropic-compatible APIs

For clients that rely on tool invocation behavior and stable usage metadata, this release should be noticeably more reliable.

WebUI and Admin Rebuild

This release is not just backend work. The built-in frontend has also been significantly refreshed.

Included in this update:

  • rebuilt Admin pages for:
    • login
    • account management
    • config management
    • cache management
  • rebuilt WebUI pages for:
    • Web Chat
    • Masonry
    • ChatKit
  • unified static assets under app/statics
  • improved page asset versioning to reduce stale-cache issues
  • added a lightweight global footer and multiple UX refinements
  • moved WebUI Chat model selection closer to the send action for a better interaction flow

Runtime and Architecture Refactor

A large portion of this release is architectural work, but it directly improves maintainability and runtime clarity.

Key changes include:

  • entrypoint moved from main.py to app.main
  • backend reorganized into:
    • platform
    • control
    • dataplane
    • products
  • clearer separation across:
    • config
    • auth
    • logging
    • storage
    • model registry
    • reverse protocol / transport handling
    • product-facing routing

Runtime behavior was also improved for production-style deployments:

  • lightweight account-directory sync across workers
  • file-lock-based scheduler leader election
  • more explicit account refresh and quota sync flows
  • better logging and clearer runtime state handling

Account and Proxy Improvements

Account orchestration and proxy handling were both significantly improved.

Notable changes:

  • clearer separation for local / Redis / SQL account backends
  • stronger account refresh behavior
  • improved invalid-credential handling
  • better quota defaults and state-machine behavior
  • more robust multi-account pool orchestration
  • improved manual clearance and FlareSolverr integration
  • better proxy feedback, session reset, and scheduling logic

These changes matter directly for long-running stability and heavier traffic.

Deployment and Operations

Deployment ergonomics were also improved.

This release includes:

  • a new .env.example
  • clearer config layering across .env, data/config.toml, and config.defaults.toml
  • improved Docker build flow
  • container health checks
  • version and update metadata endpoints
  • refreshed Chinese and English documentation

Tests

This release adds more live coverage for key protocol paths, including:

  • tool calls
  • responses tool calls
  • Anthropic messages flows

Breaking Changes

This release includes multiple changes that are not backward-compatible with older versions. Please back up your data and configuration before upgrading.

Please review carefully:

  • API key verification is now enforced more consistently on model- and response-related OpenAI-compatible endpoints
  • WebUI config keys were renamed:
    • function_enabled -> webui_enabled
    • function_key -> webui_key
  • legacy static asset layouts under _public and intermediate app/assets were replaced by the new app/statics structure
  • some older migration scripts and legacy service paths were removed
  • deployment entrypoint should now use app.main:app

Upgrade Notes

Before upgrading, it is strongly recommended to:

  • back up the entire data/ directory
  • back up your current config files and deployment environment variables
  • check whether you still use old function_* configuration keys
  • update deployment scripts that still target the old entrypoint
  • review any custom integrations that depend on previous static asset paths or internal module paths
  • monitor logs carefully on first startup in multi-worker deployments to verify scheduler and account-sync behavior

This release should be treated as a migration-aware upgrade rather than a drop-in replacement.

Full Changelog: v1.6.0...v2.0.0.rc0

Don't miss a new grok2api release

NewReleases is sending notifications on new releases.