github unslothai/unsloth v0.1.44-beta
Gemma 4 12B, New UI, MCP, Projects

4 hours ago

Hey everyone, this update focuses mainly on MCP, Projects, Canvas and the new chat UI.
We've also made many improvements across Studio. Next week we'll have an even bigger update.

To update Unsloth or install a new Unsloth Studio, you must use:

macOS, Linux, WSL:

curl -fsSL https://unsloth.ai/install.sh | sh

Windows:

irm https://unsloth.ai/install.ps1 | iex

Warning

DO NOT USE unsloth studio update since packaging will not get the latest updates

Gemma 4 12B

Google releases Gemma 4 12B, a new model that runs locally on 8GB RAM. GGUF / Guide
Gemma 4 12B Unified supports image, audio and 256K context. Run and train the model via Unsloth Studio.

MCP

  • Give your model live tools instead of relying on memory one click in the composer, no API keys for the built-ins
  • Built-in presets:
    • Context7 current docs & code for thousands of libraries
    • Exa live web search
    • Hugging Face search models, datasets & papers
  • Add your own remote (OAuth/headers) or local (stdio) servers, toggled per chat

New Chat UI

  • Projects, Canvas, MCP, and Compare tuck into one + menu
  • Search and Code are now one click away

Projects

  • Keep related chats together in one workspace
  • Create a project from the sidebar, then add new or existing chats to it

Experimental Canvas / Artifacts

  • Opens generated HTML in a dedicated canvas panel inside Unsloth Studio
  • Supports interactive outputs, including browser based visualizations and CDN-loaded packages
  • Lets you switch between rendered preview and source code

Install, Runtime & Hardware

  • CUDA / Windows
    • CUDA 13.3 llama.cpp binaries now work on Windows (and other non-Linux) and fix the CUDA 13.2 gibberish-output bug while default still pins to CUDA 13.1 for now
    • On CUDA 13.2, 13.1 and below, Windows falls back to CUDA 12.4 and native 13.1 binaries coming soon
    • Windows prebuilt installs no longer block on the early CUDA Toolkit check
  • Linux / GPU
    • Linux llama.cpp prebuilts now match your runtime's cudart major version
    • Prebuilt coverage for Blackwell (with a CUDA 13.0 driver fallback) and B300 (sm_103)
    • ARM64 Linux now source-builds on GPU hosts, with a CPU prebuilt fallback
    • ROCm: detected AMD gfx arch is forwarded to the prebuilt installer (with a setup.sh fallback)
  • macOS
    • Fixed Apple Silicon installs that were resolving torch against x86_64

Other Studio improvements

  • Connected models now work in Compare mode
  • Smoother streaming which now renders batched to one per animation frame
  • Larger upload limits for training datasets and recipe
  • Window size and maximized state persist across launches
  • Chat search hides non-matching threads
  • Model loading handles mid-refresh cancellation cleanly
  • Cleaner rendering for generated image frames and Python tool code blocks

What's Changed

Full Changelog: v0.1.43-beta...v0.1.44-beta

Don't miss a new unsloth release

NewReleases is sending notifications on new releases.