What's Changed
- konflux: release images when a tag is pushed to the git repo by @mikebonnet in #1926
- konflux: build ramalama images for s390x and ppc64le by @mikebonnet in #1842
- konflux: run clamav-scan as a matrixed task by @mikebonnet in #1922
- s390x: switch to a smaller bigendian model for testing by @mikebonnet in #1930
- --flash-attn requires an option in llama-server now by @rhatdan in #1928
- Pass the encoding argument to run_cmd(). by @jwieleRH in #1931
- Improve NVIDIA CDI check. by @jwieleRH in #1903
- [ci] Update repo for ubuntu podman 5 by @olliewalsh in #1940
- chore(deps): update konflux references by @red-hat-konflux-kflux-prd-rh03[bot] in #1932
- chore(deps): update dependency huggingface-hub to ~=0.35.0 by @renovate[bot] in #1935
- fix: with the new llama.cpp version and chat templates rag_framework … by @bmahabirbu in #1937
- Introduce tox for testing and add e2e framework by @telemaco in #1938
- docs: revert incorrect docs changes by @cdoern in #1936
- Add bats test to cover docker-compose in serve by @abhibongale in #1934
- konflux: set the source-repo-url annotation on the override Snapshot by @mikebonnet in #1941
- Add Compose docs by @abhibongale in #1943
- chore(deps): update registry.access.redhat.com/ubi9/ubi docker tag to v9.6-1758184894 by @renovate[bot] in #1944
- Adds a roadmap document for tracking future work and goals by @ieaves in #1893
- konflux: handle "incoming" events when creating override Snapshots by @mikebonnet in #1945
- added mcp to chat by @bmahabirbu in #1923
- introduced some qol fixed for standard python mcp client by @bmahabirbu in #1953
- chore(deps): update konflux references by @red-hat-konflux-kflux-prd-rh03[bot] in #1954
- Add e2e pytest workflows to Github CI by @telemaco in #1950
- Add e2e pytest test for bench command by @telemaco in #1942
- Reorganize transports and add new rlcr transport option by @ieaves in #1907
- Bump to v0.12.3 by @rhatdan in #1956
New Contributors
Full Changelog: v0.12.2...v0.12.3