Automated Android build artifacts for commit 8d71e829d575daf5788cc51d479a3cf98b24c340.
Workflow run: https://github.com/LettuceAI/app/actions/runs/26717152674
Changes since previous Android build android-dev-226-1-07a286a
Compare: android-dev-226-1-07a286a...8d71e82
8d71e82docs(changelog): add llama.cpp mixed-offload context sizing and VRAM headroom fixesb57cee6fix(chat): keep configured bubble width when header-above or message-info row is shown11760d0fix(llama): account for GPU-offloaded weights when sizing context for mixed offloadb49e338refactor(llama-cpp): derive compute-buffer VRAM reserve from model dims and batch97d552ffix(llama-cpp): retry smaller context on OOM even when an explicit KV type is set1f23950fix(llama-cpp): reserve compute-buffer VRAM by context and stop offload from exceeding the safe layer estimate1086fb9docs(changelog): clarify per-message info shows the generating model72029bdfix(chat-appearance): persist per-message model id so info shows the generating model, not the current oneba538dbfix(chat-appearance): only show per-message info on assistant and scene messages