Box
Box's first major-version release. The headline is hardware NPU/TPU acceleration coming to
life — the Google Tensor G5 (Pixel 10) and MediaTek neural engines now run models on-device,
joined by a batch of new models, an on-device Face Recognition feature, and a brighter,
fully theme-aware interface.
⚡ Hardware acceleration
- Google Tensor G5 (Pixel 10) acceleration (new). Gemma now runs on the Pixel 10's Tensor
G5 neural engine, not just the GPU. Supported models route to the TPU automatically on Pixel 10
and expose a dedicated TPU option in the accelerator picker. - MediaTek NPU support (new). Box now ships the runtime and the first models that run on
MediaTek Dimensity neural engines — the first MediaTek NPU models in Box. - Snapdragon & Tensor NPU packaging fix. Native NPU/DSP libraries are now correctly unpacked
on install, so NPU models initialize reliably instead of failing to find their runtime.
🤖 New models
- Gemma 4 E2B (Tensor G5) — NPU-optimized build for the Pixel 10. Multimodal (text + image +
audio), 32K context, thinking. - Gemma 4 12B — large Gemma 4 model (GPU), text, 32K context. For high-memory devices.
- Gemma 3 1B-IT (Tensor G5) — compact Pixel 10 NPU model.
- Gemma 3n E2B (MediaTek) — multimodal (image + audio) NPU model for MediaTek devices.
- Qwen3 0.6B (MediaTek) — compact MediaTek NPU model.
🧑 Face Recognition (new, on-device & private)
- A new Face Recognition feature (in the Image section): detect, enroll and name people, then
recognize them in photos or live from the camera — fully offline. - Enroll multiple samples per person for better accuracy; faces are aligned before matching for
reliable results. - Capture-to-add from the live camera, an on-screen face mesh overlay, and a settings
panel (match strictness, front-camera default, show match %, clear all). - All face data is encrypted on-device and never leaves the phone — opt-in and user-enrolled
only, with no external or web-scraped identity database.
🎨 Interface & themes
- New Light theme. A crisp, truly-bright light mode that doesn't tint toward your wallpaper —
for people who found the dynamic light theme not light enough. - Theme-aware home screen. The home background now follows your selected theme (System /
Light / Catppuccin / Dracula) instead of always being black, with a subtle accent tint so each
theme is visibly distinct. - Fixed the theme picker squeezing the "Catppuccin" label.
🤝 Gemini Nano (stock & custom-ROM)
- Gemini Nano Hub now on the custom-ROM build too — Summarize / Proofread / Rewrite /
Describe / Chat / Speech, previously stock-only. (Degoogled devices without AICore degrade
gracefully; ML-Kit vision features still work.) - Fixed a crash when attaching a document in Summarize/Proofread/Rewrite (the file picker
could be hijacked into the photo picker on Android 14+). Documents now open through the proper
document picker, with a clean fallback if none is available. - Fixed memory/service leaks when moving between Nano features.
💬 Chat improvements
- Copy button on code blocks. Fenced code blocks in chat now render with a language label and
a one-tap Copy code button. - Speech-to-text with SenseVoice in Chat. The chat mic now works with a loaded SenseVoice
model (priority: Whisper → SenseVoice → system) — no longer dead-ends asking for Whisper. - Speculative decoding is available for Gemma 4 in chat (off by default).
- Gemma Terms of Use. A one-time consent dialog (linking the Gemma Terms of Use and Google's
Prohibited Use Policy) now appears before downloading any Gemma model.
🛠️ Fixes & under the hood
- Agent mode now works with text-only models (#69). Agent mode no longer force-loads vision
on models that don't support it, which previously blocked text-only imported models entirely. - Correct installed version (#67). Fixes Obtainium / "App version" showing the wrong version
(and "update available" loops) by aligning the internal and public version numbers — the reason
for the 2.0.0 jump. - Large GPU models fall back to CPU instead of dead-ending when they exceed the GPU's limits.
- Gemma 4 12B initialization fix — loads correctly on GPU instead of failing with a
misleading backend/audio-adapter error.
Build variants
- Main (stock Android): includes the Gemini Nano Hub & Vision features (AICore modes need a
supported device; ML-Kit vision runs broadly). - custom-rom-support (GrapheneOS / LineageOS / CalyxOS): now also includes the Gemini Nano
Hub; no Google-services-dependent features required.