github ggml-org/llama.cpp b8670

latest release: b8671
3 hours ago
Details

model : add HunyuanOCR support (#21395)

  • HunyuanOCR: add support for text and vision models
  • Add HunyuanOCR vision projector (perceiver-based) with Conv2d merge
  • Add separate HUNYUAN_OCR chat template (content-before-role format)
  • Handle HunyuanOCR's invalid pad_token_id=-1 in converter
  • Fix EOS/EOT token IDs from generation_config.json
  • Support xdrope RoPE scaling type
  • Add tensor mappings for perceiver projector (mm.before_rms, mm.after_rms, etc.)
  • Register HunYuanVLForConditionalGeneration for both text and mmproj conversion
  • fix proper mapping

  • Update gguf-py/gguf/tensor_mapping.py

Co-authored-by: Xuan-Son Nguyen thichthat@gmail.com

  • Update tools/mtmd/clip.cpp

Co-authored-by: Xuan-Son Nguyen thichthat@gmail.com

  • address comments

  • update

  • Fix typecheck

  • Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

  • Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com


Co-authored-by: Xuan-Son Nguyen thichthat@gmail.com
Co-authored-by: Sigbjørn Skjæret sigbjorn.skjaeret@scala.com

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.