github ggml-org/llama.cpp b8477

6 hours ago
Details

mtmd: Add dynamic high-resolution image preprocessing for InternVL model (#20847)

  • added support for internvl's dynamic high-resolution (Qianfan-OCR needed)

  • add min/max dynamic patch to gguf meta

  • clean up

  • simplified handling min/max dynamic patch

  • reuse llava_uhd logic for slice images

  • provide default values for older models

  • flake8

  • prevent writing 0 value to gguf

  • remove duplicated resolution candidates with a better algorithm

  • fix indentation

  • format

  • add protection from divide by zero

  • change to 0 to be safe


Co-authored-by: Xuan Son Nguyen son@huggingface.co

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.