github ggml-org/llama.cpp b7583

latest releases: b7976, b7974, b7973...
one month ago
Details

lora: count lora nodes in graph_max_nodes (#18469)

  • lora: count lora nodes in graph_max_nodes

  • 3 nodes per weight

  • 4 nodes

  • keep track n_lora_nodes from llama_model

  • fix assert

  • rm redundant header

  • common: load adapters before context creation

  • use 6 nodes

macOS/iOS:

Linux:

Windows:

openEuler:

Don't miss a new llama.cpp release

NewReleases is sending notifications on new releases.