github jegly/Box v1.0.6
Box v1.0.6

latest releases: v3.2.0, v3.1.0, test_build...
one month ago
04-phosphor

### Box v1.0.6

New: On-Device Image Generation

First stable release of on-device image generation. Previous builds had crashes and instability in the Stable Diffusion
pipeline — v1.0.6 resolves these and is the first version we'd consider ready for daily use.

- Image generation time dropped from ~27 minutes to under 4 minutes — earlier builds required 20 steps with base SD 1.5;       

LCM-SSD-1B (recommended) produces quality results in just 4 steps thanks to consistency distillation, combined with the CPU
optimisations below

  • Added Image Gen — generate images from text prompts fully on-device using Stable Diffusion
  • 6 models available to download, powered by stable-diffusion.cpp:
    • LCM-SSD-1B Q4_K (~2.2 GB) — recommended, fast SDXL-class results in 4 steps
    • SDXL-Lightning 4-step Q4_0 (~2.8 GB) — ByteDance distillation, high quality in 4 steps
    • SDXL-Turbo Q4_0 (~4.2 GB) — vivid results in 1–4 steps
    • SDXL Base Q4_0 (~3.9 GB) — full SDXL at native 1024² resolution
    • SD 1.5 Q4_0 (~2.1 GB) — classic reliable all-rounder
    • SD 1.5 Q8_0 (~4.0 GB) — higher precision SD 1.5
  • Adjustable steps, CFG scale, negative prompt, and image size (256² up to 1024²)
  • Save generated images directly to your gallery
  • Import your own GGUF model files from device storage
  • Fixed crash (SIGSEGV) on second generation — was a use-after-free in the sd.cpp context; context is now reloaded before each
    generation

### Performance

  • Enabled ARM dotprod + FP16 CPU kernels for all ggml-based inference (Stable Diffusion, SmolLM,llama.cpp) — these were silently disabled
    at build time due to cross-compilation defaults
  • Upgraded ggml release builds from -O2 to -O3 optimisation
  • Enabled flash attention in Stable Diffusion — reduces memory pressure during sampling
  • Compiled in experimental Vulkan GPU backend for Stable Diffusion (auto-falls back to CPU if unsupported)

Voice Input / Audio Scribe

  • Removed Whisper Tiny — was hanging on transcription; Whisper Base is now the smallest option

General

  • Model cards now show descriptions when expanded — tap any model card to see details and recommended settings
  • All Image Gen model cards include plain-English descriptions and recommended CFG/step settings

Two variants available:

  • Box_v1.0.6_Main_Signed_Release.apk — stock Android 15 +
  • Box_v1.0.6_custom-rom-support_Signed_Release.apk — GrapheneOS / custom ROMs without Google Play Services

APK size reduced from ~890 MB to ~550 MB — native libraries are now compiled for arm64 only, dropping unused x86 and 32-bit
ARM builds. All devices that can run Box (Android 15+) are arm64.

Don't miss a new Box release

NewReleases is sending notifications on new releases.