github ggml-org/llama.cpp b9414

2 hours ago
Details

mtmd: Add DeepSeekOCR 2 Support (#20975)

  • mtmd: DeepSeek-OCR 2 support, with multi-tile dynamic resolution

  • introduced clip_image_f32::add_viewsep

  • address PR review

  • drop redundant ggml_cpy ops in both deepseekocr versions build
  • drop no-op ggml_cont in build_sam
  • assert num_image_tokens deepseekocr2
  • view_seperator as (1, n_embd) at conversion (for both versions)
  • drop redundant ggml_reshape_2d
  • Update tools/mtmd/models/deepseekocr2.cpp

Co-authored-by: Xuan-Son Nguyen thichthat@gmail.com


Co-authored-by: Xuan-Son Nguyen thichthat@gmail.com

macOS/iOS:

Linux:

Android:

Windows:

openEuler:

  • DISABLED
  • openEuler x86 (310p)
  • openEuler x86 (910b, ACL Graph)
  • openEuler aarch64 (310p)
  • openEuler aarch64 (910b, ACL Graph)

UI:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.