What's Changed
- docs: Add GPU inference doc by @Sherlock113 in #4654
- chore: update quickstart by @ssheng in #4655
- docs: Add JSON output for bentovllm by @Sherlock113 in #4657
- chore: cleanup quickstart by @ssheng in #4658
- docs: Update help info by @Sherlock113 in #4664
- fix: remove the uvicorn server header by @frostming in #4665
- docs: Fix format by @Sherlock113 in #4666
- docs: Add model composition doc by @Sherlock113 in #4668
- docs: Update example project list by @Sherlock113 in #4673
- docs: Add the monitoring and data collection doc by @Sherlock113 in #4662
- docs: Add add_asgi_middleware doc by @Sherlock113 in #4672
- fix: delete useless enum and fix enum value by @FogDong in #4674
- docs: Add RAG tutorial by @Sherlock113 in #4675
- docs: Update the clients doc by @Sherlock113 in #4676
- docs: Add some explanations for bentoml.models.get by @Sherlock113 in #4660
- docs: Add e2e test doc by @Sherlock113 in #4679
- fix(cloud client): various type error by @bojiang in #4680
- fix(cli): bentoml cli verbosity not passed to the subprocess correctly by @frostming in #4661
Full Changelog: v1.2.11...v1.2.12