koboldcpp-1.6
- This is a bugfix release, to try and see if it resolves the recent crashing issues reported.
- Recent CLBlast fixes merged, now shows GPU name.
- Batch size reduced back from 1024 to 512 due to reported crashes.
To use, download and run the koboldcpp.exe, which is a one-file pyinstaller.
Alternatively, drag and drop a compatible ggml model on top of the .exe, or run it and manually select the model in the popup dialog.
and then once loaded, you can connect like this (or use the full koboldai client):
http://localhost:5001
For more information, be sure to run the program with the --help
flag.
Alternative Options:
Non-AVX2 version now included in the same .exe file, enable with --noavx2
flags