What's Changed (this repo branch)
Sync to Ollama main v0.6.8
What's Changed (from Ollama)
What's Changed
- Performance improvements for Qwen 3 MoE models (
30b-a3band235b-a22b) on NVIDIA and AMD GPUs - Fixed
GGML_ASSERT(tensor->op == GGML_OP_UNARY) failedissue caused by conflicting installations - Fixed a memory leak that occurred when providing images as input
ollama showwill now correctly label older vision models such asllava- Reduced out of memory errors by improving worst-case memory estimations
- Fix issue that resulted in a
context cancelederror
New Contributors
- @AliAhmedNada made their first contribution in ollama#10522
- @HarshNevse made their first contribution in ollama#10465
Full Changelog: v0.6.7...v0.6.8