Details
mtmd: Add dynamic high-resolution image preprocessing for InternVL model (#20847)
-
added support for internvl's dynamic high-resolution (Qianfan-OCR needed)
-
add min/max dynamic patch to gguf meta
-
clean up
-
simplified handling min/max dynamic patch
-
reuse llava_uhd logic for slice images
-
provide default values for older models
-
flake8
-
prevent writing 0 value to gguf
-
remove duplicated resolution candidates with a better algorithm
-
fix indentation
-
format
-
add protection from divide by zero
-
change to 0 to be safe
Co-authored-by: Xuan Son Nguyen son@huggingface.co
macOS/iOS:
Linux:
Windows:
- Windows x64 (CPU)
- Windows arm64 (CPU)
- Windows x64 (CUDA 12) - CUDA 12.4 DLLs
- Windows x64 (CUDA 13) - CUDA 13.1 DLLs
- Windows x64 (Vulkan)
- Windows x64 (SYCL)
- Windows x64 (HIP)
openEuler: