🚀 Features
👷 Built-in distribution support: running completion / chat model worker on different process / machine.

The community edition has a restriction of a maximum of one worker for code completion/chat. The enterprise edition is still in private alpha and is only available to our design partners. If you're interested, please DM Meng Zhang
on the Slack channel to apply (limited slots available).
💬 Conversation history in chat playground.

🧰 Fixes and Improvements
- Fix the slow repository indexing due to constraint memory arena in tantivy index writer.
- Command line argument
--model
is now optional, so users can create a chat only instance. - New command line argument
--parallelism
to control the throughput and VRAM usage: #727 - New api path
/metrics
endpoint for prometheus metrics collection.
💫 New Contributors
- @liangfung made their first contribution in #702
- @erfanium made their first contribution in #742
- @costanzo made their first contribution in #748
- @darknight made their first contribution in #750
- @suside made their first contribution in #775
- @jpoisso made their first contribution in #838
- @Squadrick made their first contribution in #849
- @sonique6784 made their first contribution in #813
Full Changelog: v0.5.5...v0.6.0-rc.0