Releases: bentoml/BentoML
BentoML-1.0.0a3
This is a preview release for BentoML 1.0, check out the quick start guide here: https://docs.bentoml.org/en/latest/quickstart.html and documentation at http://docs.bentoml.org
BentoML-1.0.0a2
This is a preview release for BentoML 1.0, check out the quick start guide here: https://docs.bentoml.org/en/latest/quickstart.html and documentation at http://docs.bentoml.org
BentoML-0.13.1
Detailed Changelog: v0.13.0...v0.13.1
Overview
BentoML 0.13.1 is a minor release containing mostly bug fixes and internal changes.
Changelog
-
feat: SLO - API server max latency (#1583)
-
feat: Save OpenAPI Spec Json in BentoML bundle (#1686)
-
fix: BentoService loading user-provided env.yml file in runtime (#1695)
-
fix: BentoArtifact initialize with parameter issue (#1696)
-
fix: Use $BENTOML_PORT as Dockerfile default port (#1706)
-
fix: Fix missing s3_endpoint_url (#1708)
-
fix: Wrap request in sagemaker model_server (#1716)
-
refactor: Add deprecation warnings for deployment CLI commands (#1718)
-
refactor replace di framework (#1697)
-
ci: PaddlePaddle Intergration test (#1739)
BentoML-0.13.0
Detailed Changelog: v0.12.1...v0.13.0
Overview
BentoML 0.13.0 is here! It's a release packed with lots of new features and important bug fixes. We encourage all users to upgrade.
❤️ Contributors
Thanks to @aarnphm @andrewsi-z @larme @gregd33 @bojiang @ssheng @henrywu2019 @yubozhao @jack1902 @illy @sencenan @parano @soeque1 @elia-secchi @Shumpei-Kikuta @StevenReitsma @dsherry @AnvithaGadagi @joaquincabezas for the contributions!
📢 Breaking Changes
-
Configuration revamp
- The
bentoml config
CLI command has been fully deprecated in this release - New config system was introduced for configuring BentoML api server, yatai,
tracing and more (#1543, #1595, #1615, #1667) - Documentation: https://docs.bentoml.org/en/latest/guides/configuration.html
- Add --do-not-track CLI option and environment variable (#1534)
- The
-
Deprecated --enable-microbatch flag
- Use the
@api(batch=True|False)
option to choose between microbatch enabled
API vs. non-batch API - For API defined in batch mode but requires serving online traffic without
batching behavior, use--mb-max-batch-size=1
instead
- Use the
🎉 New Features
-
GPU Support
- GPU serving guide https://docs.bentoml.org/en/latest/guides/gpu_serving.html
- Added docker base image optimized for GPU serving (#1653)
-
Add support for EvalML (#1603)
-
Add support for ONNX-MLIR model (#1545)
-
Add full CORS support for bento API server (#1576)
-
Monitoring with Prometheus Gudie
-
Optimize BentoML import delay (#1608)
-
Support upload/download for Yatai backed by local file system storage (#1586)
🐞 Bug Fixes and Other Changes
-
Fix StringInput with batch=True API (#1581)
-
Fix docs.json link in API server UI (#1633)
-
Fix uploading to remote path (#1601)
-
Fix label missing after uploading Bento to remote Yatai (#1598)
-
Fixes /metrics endpoints with serve-gunicorn (#1666)
-
Upgrade conda to 4.9.2 in default docker base image (#1525)
-
Internal:
BentoML-0.12.1
Detailed Changelog: v0.12.0...v0.12.1
PaddlePaddle Support
We are thrilled to announce that BentoML now fully supports the PaddlePaddle framework from Baidu. Users can easily serve their own models created with Paddle via Paddle Inference and serve pre-trained models from PaddleHub, which contains over 300 production-grade pre-trained models.
Tutorial notebooks for using BentoML with PaddlePaddle:
- Paddle Inference: https://github.com/bentoml/gallery/blob/master/paddlepaddle/LinearRegression/LinearRegression.ipynb
- PaddleHub: https://github.com/bentoml/gallery/blob/master/paddlehub/image-segmentation/image-segmentation.ipynb
See the announcement and release note from PaddleHub: https://github.com/PaddlePaddle/PaddleHub/releases/tag/v2.1.0
Thank you @cqvu @deehrlic for contributing this feature in BentoML.
Bug fixes
BentoML-0.12.0
Detailed Changelog: v0.11.0...v0.12.0
New Features
-
Breaking Change: Default Model Worker count is set to one #1454
- Please use the
--worker
CLI argument for specifying a number of workers of your deployment - For heavy production workload, we recommend experiment with different worker count and benchmark test your BentoML service in API server in your target hardware to get a better understanding of the model server performance
- Please use the
-
Breaking Change: Micro-batching layer(Marshal Server) is now enabled by default #1498
- For Inference APIs defined with
batch=True
, this will enable micro-batching behavior when serving. User can disable with the--diable-microbatch
flag - For Inference APIs with
batch=False
, API requests are now being queued in Marshal and then forwarded to the model backend server
- For Inference APIs defined with
-
New: Use non-root user in BentoML's API server docker image
-
New: API/CLI for bulk delete of BentoML bundle in Yatai #1313
-
Easier dependency management for PyPI and conda
- Support all pip install options via a user-provided
requirements.txt
file - Breaking Change: when
requirements_txt_file
option is in use, other pip package options will be ignored conda_override_channels
option for using explicit conda channel for conda dependencies: https://docs.bentoml.org/en/latest/concepts.html#conda-packages
- Support all pip install options via a user-provided
- Better support for pip install options and remote python dependencies #1421
- Let BentoML do it for you:
@bentoml.env(infer_pip_packages=True)
- use the existing "pip_packages" API, to specify list of dependencies:
@bentoml.env(
pip_packages=[
'scikit-learn',
'pandas @https://github.com/pypa/pip/archive/1.3.1.zip',
]
)
- use a requirements.txt file to specify all dependencies:
@bentoml.env(requirements_txt_file='./requirements.txt')
In the ./requirements.txt
file, all pip install options can be used:
#
# These requirements were autogenerated by pipenv
# To regenerate from the project's Pipfile, run:
#
# pipenv lock --requirements
#
-i https://pypi.org/simple
scikit-learn==0.20.3
aws-sam-cli==0.33.1
psycopg2-binary
azure-cli
bentoml
pandas @https://github.com/pypa/pip/archive/1.3.1.zip
https://[username[:password]@]pypi.company.com/simple
https://user:he%2F%2Fo@pypi.company.com
git+https://myvcs.com/some_dependency@sometag#egg=SomeDependency
- API/CLI for bulk delete #1313
CLI command for delete:
# Delete all saved Bento with specific name
bentoml delete --name IrisClassifier
bentoml delete --name IrisClassifier -y # do it without confirming with user
bentoml delete --name IrisClassifier --yatai-url=yatai.mycompany.com # delete in remote Yatai
# Delete all saved Bento with specific tag
bentoml delete --labels "env=dev"
bentoml delete --labels "env=dev, user=foobar"
bentoml delete --labels "key1=value1, key2!=value2, key3 In (value3, value3a), key4 DoesNotExist"
# Delete multiple saved Bento by their name:version tag
bentoml delete --tag "IrisClassifier:v1, MyService:v3, FooBar:20200103_Lkj81a"
# Delete all
bentoml delete --all
Yatai Client Python API:
yc = get_yatai_client() # local Yatai
yc = get_yatai_client('remote.yatai.com:50051') # remoate Yatai
yc.repository.delete(prune, labels, bento_tag, bento_name, bento_version, require_confirm)
"""
Params:
prune: boolean, Set true to delete all bento services
bento_tag: Bento tag
labels: string, label selector to filter bento services to delete
bento_name: string
bento_version: string,
require_confirm: boolean require user confirm interactively in CLI
"""
- #1334 Customize route of an API endpoint
@env(infer_pip_packages=True)
@artifacts([...])
class MyPredictionService(BentoService)
@api(route="/my_url_route/foo/bar", batch=True, input=DataframeInput())
def predict(self, df):
# instead of "/predict", the URL for this API endpoint will be "/my_url_route/foo/bar"
...
v0.11.0
New Features
Detailed Changelog: v0.10.1...v0.11.0
Interactively start and stop Model API Server during development
A new API was introduced in 0.11.0 for users to start and test an API server while developing their BentoService class:
service = MyPredictionService()
service.pack("model", model)
# Start an API model server in the background
service.start_dev_server(port=5000)
# Send test request to the server or open the URL in browser
requests.post(f'http://localhost:5000/predict', data=review, headers=headers)
# Stop the dev server
service.stop_dev_server()
# Modify code and repeat ♻️
Here's an example notebook showcasing this new feature.
More PyTorch eco-system Integrations
Logging is fully customizable now!
Users can now use one single YAML file to customize the logging behavior in BentoML, including the prediction logs and feedback logs.
https://docs.bentoml.org/en/latest/guides/logging.html
Two new configs are also introduced for quickly turning on/off console logging and file logging:
https://github.com/bentoml/BentoML/blob/v0.11.0/bentoml/configuration/default_bentoml.cfg#L29
[logging]
console_logging_enabled = true
file_logging_enabled = true
If you are not sure how this config works, here's a new guide on how BentoML's configuration works: https://docs.bentoml.org/en/latest/guides/configuration.html
More model management APIs
All model management CLI and Yatai client python API now supports the yatai_url
parameter, making it easy to interact with a remote YataiService, for centrally manage all your BentoML packaged ML models:
Support bundling zipimport modules #1261
Bundling zipmodules
with BentoML is possible now with this newly added API:
@bentoml.env(zipimport_archives=['nested_zipmodule.zip'])
@bentoml.artifacts([SklearnModelArtifact('model')])
class IrisClassifier(bentoml.BentoService):
...
BentoML also manages the sys.path
when loading a saved BentoService with zipimport archives, making sure the zip modules can be imported in user code.
Announcements
Monthly Community Meeting
Thank you again for everyone coming to the first community meeting this week! If you are not invited to the community meeting calendar yet, make sure to join it here: https://github.com/bentoml/BentoML/discussions/1396
Hiring
BentoML team is hiring multiple Software Engineer roles to help build the future of this open-source project and the business behind it - we are looking for someone with experience in one of the following areas: ML infrastructure, backend systems, data engineering, SRE, full-stack, and technical writing. Feel free to pass along the message to anyone you know who might be interested, we'd really appreciate that!
BentoML-0.10.1
Bug Fix
This is a minor release containing one bug fix for issue #1318, where the docker build process for the BentoML API model server was broken due to an error in the init shell script. The issue has been fixed in #1319 and included in this new release.
The reason our integration tests did not catch this issue was due to the fact that we are bundling the "dirty" BentoML installation in the generated docker file in the development environment and CI/Test environment, whereas the production release version of BentoML, uses the BentoML installed from PyPI. And the issue in #1318 was an edge case that can be triggered only when using the released version of BentoML and published docker image. We are investigating ways to run all our integration tests with a preview release before making a final release, as part of our QA process, which should help us prevent this type of bugs from getting into final releases in the future.
BentoML-0.10.0
New Features & Improvements
from bentoml.yatai.client import get_yatai_client
bento_service.save() # Save and register the bento service locally
# push to save bento service to remote yatai service.
yc = get_yatai_client('http://staging.yatai.mycompany.com:50050')
yc.repository.push(
f'{bento_service.name}:{bento_service.version}',
)
# Pull bento service from remote yatai server and register locally
yc = get_yatai_client('http://staging.yatai.mycompany.com:50050')
yc.repository.pull(
'bento_name:version',
)
#delete in local yatai
yatai_client = get_yatai_client()
yatai_client.repository.delete('name:version')
# delete in batch by labels
yatai_client = get_yatai_client()
yatai_client.prune(labels='cicd=failed, framework In (sklearn, xgboost)')
# Get bento service metadata
yatai_client.repository.get('bento_name:version', yatai_url='http://staging.yatai.mycompany.com:50050')
# List bento services by label
yatai_client.repositorylist(labels='label_key In (value1, value2), label_key2 Exists', yatai_url='http://staging.yatai.mycompany.com:50050')
New CLI commands for model management:
Push local bento service to remote yatai service:
$ bentoml push bento_service_name:version --yatai-url http://staging.yatai.mycompany.com:50050
Added --yatai-url
option for the following CLI commands to interact with remote yatai service directly:
bentoml get
bentoml list
bentoml delete
bentoml retrieve
bentoml run
bentoml serve
bentoml serve-gunicorn
bentoml info
bentoml containerize
bentoml open-api-spec
- Model Metadata API #1179 shoutout to @jackyzha0 for designing and building this feature!
Ability to save additional metadata for any artifact type, e.g.:
model_metadata = {
'k1': 'v1',
'job_id': 'ABC',
'score': 0.84,
'datasets': ['A', 'B'],
}
svc.pack("model", test_model, metadata=model_metadata)
svc.save_to_dir(str(tmpdir))
loaded_service = bentoml.load(str(tmpdir))
print(loaded_service.artifacts.get('model').metadata)
-
Improved Tensorflow Support, by @bojiang
-
Automated AWS EC2 deployment #1160 massive 3800+ line PR by @mayurnewase
- Create auto-scaling endpoint on AWS EC2 with just one command, see documentation here https://docs.bentoml.org/en/latest/deployment/aws_ec2.html
-
Enable input & output data capture in Sagemaker deployment #1189 by @j-hartshorn
-
Faster docker image rebuild when only model artifacts are updated #1199
-
Support URL location prefix in yatai-service gRPC/Web server #1063 #1184
-
Support relative path for showing Swagger UI page in the model server #1207
-
Add onnxruntime gpu as supported backend #1213
-
Add label and artifact metadata display to yatai web ui #1249
-
Make bentoml module executable #1274
python -m bentoml <subcommand>
bentoml serve-gunicorn --enable-microbatch --mb-max-latency 3333 --mb-max-batch-size 3333 IrisClassifier:20201202154246_C8DC0A
Bug fixes
- Allow deleting bento that was previously deleted with the same name and version #1211
- Construct docker API client from env #1233
- Pin-down SqlAlchemy version #1238
- Avoid potential TypeError in batching server #1252
- Fix inference API docstring override by default #1302
Documentation
- Add examples of queries with requests for adapters #1202
- Update import paths to reflect fastai2->fastai rename #1227
- Add model artifact metadata information to the core concept page #1259
- Update adapters.rst to include new input adapters #1269
- Update quickstart guide #1262
- Docs for gluon support #1271
- Fix CURL commands for posting files in input adapters doc string #1307
Internal, CI, and Tests
- Fix installing bundled pip dependencies in Azure and Sagemaker deployments #1214 (affects bentoml developers only)
- Add Integration test for Fasttext #1221
- Add integration test for spaCy #1236
- Add integration test for models using tf native API #1245
- Add tests for run_api_server_docker_container microbatch #1247
- Add integration test for LightGBM #1243
- Update Yatai web ui node dependencies version #1256
- Add integration test for bento management #1263
- Add yatai server integration tests to Github CI #1265
- Update e2e yatai service tests #1266
- Include additional information for EC2 test #1270
- Refactor CI for TensorFlow2 #1277
- Make tensorflow integration tests run faster #1278
- Fix overrided protobuf version in CI #1286
- Add integration test for tf1 #1285
- Refactor yatai service integration test #1290
- Refactor Saved Bundle Loader #1291
- Fix flaky yatai service integration tests #1298
- Refine KerasModelArtifact & its integration test #1295
- Improve API server integration tests #1299
- Add integration tests for ragged_tensor #1303
Announcements
- We have started using Github Projects feature to track roadmap items for BentoML, you can find it here: https://github.com/bentoml/BentoML/projects/1
- We are hiring senior engineers and a lead developer advocate to join our team, let us know if you or someone you know might be interested 👉 [email protected]
- Apologize for the long wait between 0.9 and 0.10 releases, we are getting back to doing our bi-weekly release schedule now! We need help with documenting new features, writing release notes as well as QA new release before it went out, let us know if you'd be interested in helping out!
Thank you everyone for contributing to this release! @j-hartshorn @withsmilo @yubozhao @bojiang @changhw01 @mayurnewase @telescopic @jackyzha0 @pncnmnp @kishore-ganesh @rhbian @liusy182 @awalvie @cathy-kim @jsemric 🎉🎉🎉