Weβre thrilled to announce the addition of three more outstanding serverless Inference Providers to the Hugging Face Hub: Fireworks AI, Hyperbolic, Nebius AI Studio, and Novita. These providers join our growing ecosystem, enhancing the breadth and capabilities of serverless inference directly on the Hubβs model pages. This release adds official support for these 3 providers, making it super easy to use a wide variety of models with your preferred providers.
See our announcement blog for more details: https://huggingface.co/blog/new-inference-providers.
- Add Fireworks AI provider + instructions for new provider by @Wauplin in #2848
- Add Hyperbolic provider by @hanouticelina in #2863
- Add Novita provider by @hanouticelina in #2865
- Nebius AI Studio provider added by @Aktsvigun in #2866
- Add Black Forest Labs provider by @hanouticelina in #2864
Note that Black Forest Labs is not yet supported on the Hub. Once we announce it, huggingface_hub 0.29.0
will automatically support it.
β‘ Other Inference updates
- Default to
base_url
if provided by @Wauplin in #2805 - update supported models by @hanouticelina in #2813
- [InferenceClient] Better handling of task parameters by @hanouticelina in #2812
- Add YuE (music gen) from fal.ai by @Wauplin in #2801
- [InferenceClient] Renaming
extra_parameters
toextra_body
by @hanouticelina in #2821 - fix automatic-speech-recognition output parsing by @hanouticelina in #2826
- [Bot] Update inference types by @HuggingFaceInfra in #2791
- Support inferenceProviderMapping as expand property by @Wauplin in #2841
- Handle extra fields in inference types by @Wauplin in #2839
- [InferenceClient] Add dynamic inference providers mapping by @hanouticelina in #2836
- (misc) Deprecate some hf-inference specific features (wait-for-model header, can't override model's task, get_model_status, list_deployed_models) by @Wauplin in #2851
- Partial revert #2851: allow task override on sentence-similarity by @Wauplin in #2861
- Fix Inference Client VCR tests by @hanouticelina in #2858
- update new provider doc by @hanouticelina in #2870
π Breaking changes
None.
π οΈ Small fixes and maintenance
π QoL improvements
- dev(narugo): add resume for ranged headers of http_get function by @narugo1992 in #2823
π Bug and typo fixes
- [Docs] Fix broken link in CLI guide documentation by @hanouticelina in #2799
- fix by @anael-l in #2806): Replace urljoin for HF_ENDPOINT paths
- InferenceClient some minor docstrings thingies by @julien-c in #2810
- Do not send staging token to production by @Wauplin in #2811
- Add
HF_DEBUG
environment variable for debugging/reproducibility by @Wauplin in #2819 - Fix curlify by @Wauplin in #2828
- Improve whoami() error messages by specifying token source by @aniketqw in #2814
- Fix error message if invalid token on file download by @Wauplin in #2847
- Fix test_dataset_info (missing dummy dataset) by @Wauplin in #2850
- Fix is_jsonable if integer key in dict by @Wauplin in #2857
ποΈ internal
- another test by @Wauplin (direct commit on main)
- feat(ci): ignore unverified trufflehog results by @Wauplin in #2837
- Add datasets and diffusers to prerelease tests by @Wauplin in #2834
- Always proxy hf-inference calls + update tests by @Wauplin in #2798
- Skip list_models(inference=...) tests in CI by @Wauplin in #2852
- Deterministic test_export_folder (dduf testsΓ by @Wauplin in #2854
- [cleanup] Unique constants in tests + env variable for inference tests by @Wauplin in #2855
- feat: Adds a new environment variable HF_HUB_USER_AGENT_ORIGIN to set origin of calls in user-agent by @Hugoch in #2869
Significant community contributions
The following contributors have made significant changes to the library over the last release:
- @narugo1992
- dev(narugo): add resume for ranged headers of http_get function (#2823)
- @Aktsvigun
- Nebius AI Studio provider added (#2866)