github ggml-org/llama.cpp b7414

latest releases: b7423, b7422, b7418...
11 hours ago

Warning

Release Format Update: Linux releases will soon use .tar.gz archives instead of .zip. Please make the necessary changes to your deployment scripts.

Details

model : add KORMo model (#18032)

  • vocab: add KORMo Tokenizer

  • model: add KORMoForCausalLM

  • vocab: change pretokenizer to qwen2

  • lint: fix unintended line removal

  • model: make qwen2 bias tensor optional

  • model: use qwen2 architecture for KORMo

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.