Changes
- Create llamacpp_HF loader by @oobabooga in #3062
- Make it possible to evaluate exllama perplexity by @oobabooga in #3138
- Add support for logits processors in extensions by @cyberfox in #3029
- Bump bitsandbytes to 0.40.1.post1 by @jllllll in #3156
- Bump llama cpp version by @ofirkris in #3160
- Increase alpha value limit for NTK RoPE scaling for exllama/exllama_HF by @Panchovix in #3149
- Decrease download timeout
Bug fixes
- Fix reload screen background color in dark mode
Extensions
- Color tokens by probability and/or perplexity by @SeanScripts in #3078