### Box v1.0.6
New: On-Device Image Generation
First stable release of on-device image generation. Previous builds had crashes and instability in the Stable Diffusion
pipeline — v1.0.6 resolves these and is the first version we'd consider ready for daily use.
- Image generation time dropped from ~27 minutes to under 4 minutes — earlier builds required 20 steps with base SD 1.5;
LCM-SSD-1B (recommended) produces quality results in just 4 steps thanks to consistency distillation, combined with the CPU
optimisations below
- Added Image Gen — generate images from text prompts fully on-device using Stable Diffusion
- 6 models available to download, powered by stable-diffusion.cpp:
- LCM-SSD-1B Q4_K (~2.2 GB) — recommended, fast SDXL-class results in 4 steps
- SDXL-Lightning 4-step Q4_0 (~2.8 GB) — ByteDance distillation, high quality in 4 steps
- SDXL-Turbo Q4_0 (~4.2 GB) — vivid results in 1–4 steps
- SDXL Base Q4_0 (~3.9 GB) — full SDXL at native 1024² resolution
- SD 1.5 Q4_0 (~2.1 GB) — classic reliable all-rounder
- SD 1.5 Q8_0 (~4.0 GB) — higher precision SD 1.5
- Adjustable steps, CFG scale, negative prompt, and image size (256² up to 1024²)
- Save generated images directly to your gallery
- Import your own GGUF model files from device storage
- Fixed crash (SIGSEGV) on second generation — was a use-after-free in the sd.cpp context; context is now reloaded before each
generation
### Performance
- Enabled ARM dotprod + FP16 CPU kernels for all ggml-based inference (Stable Diffusion, SmolLM,llama.cpp) — these were silently disabled
at build time due to cross-compilation defaults - Upgraded ggml release builds from -O2 to -O3 optimisation
- Enabled flash attention in Stable Diffusion — reduces memory pressure during sampling
- Compiled in experimental Vulkan GPU backend for Stable Diffusion (auto-falls back to CPU if unsupported)
Voice Input / Audio Scribe
- Removed Whisper Tiny — was hanging on transcription; Whisper Base is now the smallest option
General
- Model cards now show descriptions when expanded — tap any model card to see details and recommended settings
- All Image Gen model cards include plain-English descriptions and recommended CFG/step settings
Two variants available:
- Box_v1.0.6_Main_Signed_Release.apk — stock Android 15 +
- Box_v1.0.6_custom-rom-support_Signed_Release.apk — GrapheneOS / custom ROMs without Google Play Services
APK size reduced from ~890 MB to ~550 MB — native libraries are now compiled for arm64 only, dropping unused x86 and 32-bit
ARM builds. All devices that can run Box (Android 15+) are arm64.