What's Changed (this repo branch)
Sync to Ollama main v0.7.1
What's Changed (from Ollama)
- Improved model memory management to allocate sufficient memory to prevent crashes when running multimodal models in certain situations
- Enhanced memory estimation for models to prevent unintended memory offloading
ollama showwill now show ... when data is truncated- Fixed crash that would occur with
qwen2.5vl - Fixed crash on Nvidia's CUDA for
llama3.2-vision - Support for Alibaba's Qwen 3 and Qwen 2 architectures in Ollama's new multimodal engine
New Contributors
- @ronxldwilson made their first contribution in ollama#10763
- @DarkCaster made their first contribution in ollama#10779
Full Changelog: v0.7.0...v0.7.1