github bentoml/BentoML v1.0.2
BentoML - 1.0.2

latest releases: v1.3.7, v1.3.6, v1.3.5...
2 years ago

🍱 We have just released BentoML v1.0.2 with a number of features and bug fixes requested by the community.

  • Added support for custom model versions, e.g. bentoml.tensorflow.save_model("model_name:1.2.4", model).
  • Fixed PyTorch Runner payload serialization issue due to tensor not on CPU.
TypeError: can't convert cuda:0 device type tensor to numpy. Use Tensor.cpu() to copy the tensor to host memory first
  • Fixed Transformers GPU device assignment due to kwargs handling.
  • Fixed excessive Runner thread spawning issue under high load.
  • Fixed PyTorch Runner inference error due to saving tensor during inference mode.
RuntimeError: Inference tensors cannot be saved for backward. To work around you can make a clone to get a normal tensor and use it in autograd.
  • Fixed Keras Runner error when the input has only a single element.
  • Deprecated the validate_json option in JSON IO descriptor and recommended specifying validation logic natively in the Pydantic model.

🎨 We added an examples directory and in it you will find interesting sample projects demonstrating various applications of BentoML. We welcome your contribution if you have a project idea and would like to share with the community.

💡 We continue to update the documentation on every release to help our users unlock the full power of BentoML.

What's Changed

  • chore: remove all --pre from documentation by @aarnphm in #2738
  • chore(framework): onnx guide minor improvements by @larme in #2744
  • fix(framework): fix how pytorch DataContainer convert GPU tensor by @larme in #2739
  • doc: add missing variable by @robsonpeixoto in #2752
  • chore(deps): cattrs>=22.1.0 in setup.cfg by @sugatoray in #2758
  • fix(transformers): kwargs and migrate to framework tests by @ssheng in #2761
  • chore: add type hint for run and async_run by @aarnphm in #2760
  • docs: fix typo in SECURITY.md by @parano in #2766
  • chore: use pypa/build as PEP517 backend by @aarnphm in #2680
  • chore(e2e): capture log output by @aarnphm in #2767
  • chore: more robust prometheus directory ensuring by @bojiang in #2526
  • doc(framework): add scikit-learn section to ONNX documentation by @larme in #2764
  • chore: clean up dependencies by @sauyon in #2769
  • docs: misc docs reorganize and cleanups by @parano in #2768
  • fix(io descriptors): finish removing init_http_response by @sauyon in #2774
  • chore: fix typo by @aarnphm in #2776
  • feat(model): allow custom model versions by @sauyon in #2775
  • chore: add watchfiles as bentoml dependency by @aarnphm in #2777
  • doc(framework): keras guide by @larme in #2741
  • docs: Update service schema and validation by @ssheng in #2778
  • doc(frameworks): fix pip package syntax by @larme in #2782
  • fix(runner): thread limiter doesn't take effect by @bojiang in #2781
  • feat: add additional env var configuring num of threads in Runner by @parano in #2786
  • fix(templates): sharing variables at template level by @aarnphm in #2796
  • bug: fix JSON io_descriptor validate_json option by @parano in #2803
  • chore: improve error message when failed importing user service code by @parano in #2806
  • chore: automatic cache action version update and remove stale bot by @aarnphm in #2798
  • chore(deps): bump actions/checkout from 2 to 3 by @dependabot in #2810
  • chore(deps): bump codecov/codecov-action from 2 to 3 by @dependabot in #2811
  • chore(deps): bump github/codeql-action from 1 to 2 by @dependabot in #2813
  • chore(deps): bump actions/cache from 2 to 3 by @dependabot in #2812
  • chore(deps): bump actions/setup-python from 2 to 4 by @dependabot in #2814
  • fix(datacontainer): pytorch to_payload should disable gradient by @aarnphm in #2821
  • fix(framework): fix keras single input edge case by @larme in #2822
  • fix(framework): keras GPU handling by @larme in #2824
  • docs: update custom bentoserver guide by @parano in #2809
  • fix(runner): bind limiter to runner_ref instead by @bojiang in #2826
  • fix(pytorch): inference_mode context is thead local by @bojiang in #2828
  • fix: address multiple tags for containerize by @aarnphm in #2797
  • chore: Add gallery projects under examples by @ssheng in #2833
  • chore: running formatter on examples folder by @aarnphm in #2834
  • docs: update security auth middleware by @g0nz4rth in #2835
  • fix(io_descriptor): DataFrame columns check by @alizia in #2836
  • fix: examples directory structure by @ssheng in #2839
  • revert: "fix: address multiple tags for containerize (#2797)" by @ssheng in #2840

New Contributors

Full Changelog: v1.0.0...v1.0.1

Don't miss a new BentoML release

NewReleases is sending notifications on new releases.