github Mozilla-Ocho/llamafile 0.8.17
llamafile v0.8.17

17 days ago

llamafiler has a new web UI which supports two modes of operation:
chatbot and raw completion. Its syntax highlighting is just as advanced
as the CLI chatbot. It looks much nicer than the old web ui. In a future
release, llamafiler will be folded into llamafile to replace the old server.

  • 988c9ec Introduce raw completions web ui
  • 241bf21 Introduce /v1/completions endpoint in new server
  • 6d89f8f Add binary safety check to server
  • d18ddf1 Add redo button to new web ui
  • bc82424 Add settings modal to web ui
  • bb917bd Add vision model support to new server
  • 4c7b7d5 Implement data URI parser
  • fb4b3e6 Fix JSON parser bug
  • 9d6f89f Improve look and printability of new web ui
  • 25b6910 Make chatbot ui more printer friendly
  • 30518ca Respond to HTTP OPTIONS requests
  • 41abfa3 Work around multiple image handling
  • 35bc088 Make default system prompt configurable on web
  • 28c8e22 Scale and decimate images as needed in browser
  • 14713b5 Get basic chatbot web gui working in llamafiler
  • ef08074 Start porting syntax highlighter to JavaScript
  • fdfdb13 Port remaining highlighting code to javascript

The following improvements have been made to our terminal chatbot.

  • 12c3761 Make CLI chatbot work better with base models
  • e5c0921 Improve VT100 support
  • 4b61791 Fix VT102 support
  • d25c077 Introduce /upload and /forget commands to chatbot
  • 880ebc7 Handle empty system prompt better in cli chatbot

General improvements to this project.

  • f581c40 Fix futex prototype
  • 54d3c72 Make LLaVA fast again
  • 01b8d49 Remove n-gpu-layer limitation (#534)
  • 566cdc1 Improve Gemma system prompt generation
  • 46284fe Reduce attack surface of stb_image
  • 9bb262b Log CUDA kernel vs. runtime versions

Syntax highlighting improvements for chatbot and web ui.

  • d979a1c Add BNF syntax highlighting
  • 4a8311a Add cmake syntax highlighting
  • 40e92cf Add Ocaml syntax highlighting
  • 0995343 Add more Clojure keywords
  • 0068a37 Make D syntax highlighting better
  • 0965a4b Make some markdown improvements
  • 9b96502 Improve JS/HTML syntax highlighting
  • c0622da Put more work into markdown rendering
  • fa1c98f Improve markdown to html rendering
  • 8915432 Further improve markdown to html
  • d25fa3a Improve highlighting in new web ui
  • f5a0bd4 Fix JS regex highlighting issue
  • 2807ae6 Improve Ada syntax highlighting
  • d30da30 Syntax highlight D properly
  • 33a057e Improve Ruby some more
  • 5b0fff1 Improve Ruby syntax highlighting
  • 8413a21 Fix Ruby builtins in web gui

The latest cosmopolitan upgrade introduces a new more powerful syntax
for your .args files. They're now parsed more similarly to the shell,
with support for C style escaping in double-quoted strings. You can also
now add shell-style comments to .args files too. See tool/args/args2.c
in the cosmopolitan codebase for the definitive reference.

  • fb59488 Upgrade to Cosmo v3.9.7
  • 21af0bf Import upstream bestline changes

The following example of the new .args file syntax is provided:

# specify model
-m Qwen2.5-Coder-34B-Instruct.Q6_K.gguf

# prevent flags below from being changed
...

# specify system prompt
--system-prompt "\
you are a friendly ai assistant\n
your job is to be helpful and intelligent"

# hide some stuff from user interfaces
--nologo
--no-display-prompt

You can put .args files inside llamafile, llamafiler, and whisperfile
using the zipalign program.

The following screenshots are provided of the llamafiler web ui.

image

image

image

image

Don't miss a new llamafile release

NewReleases is sending notifications on new releases.