This patch release primarily introduces the Hunyuan DiT pipeline from the Tencent team.
Hunyuan DiT
Hunyuan DiT is a transformer-based diffusion pipeline, introduced in the Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding paper by the Tencent Hunyuan.
import torch
from diffusers import HunyuanDiTPipeline
pipe = HunyuanDiTPipeline.from_pretrained(
"Tencent-Hunyuan/HunyuanDiT-Diffusers", torch_dtype=torch.float16
)
pipe.to("cuda")
# You may also use English prompt as HunyuanDiT supports both English and Chinese
# prompt = "An astronaut riding a horse"
prompt = "一个宇航员在骑马"
image = pipe(prompt).images[0]
🧠 This pipeline has support for multi-linguality.
📜 Refer to the official docs here to learn more about it.
Thanks to @gnobitab, for contributing Hunyuan DiT in #8240.
All commits
- Release: v0.28.0 by @sayakpaul (direct commit on v0.28.1-patch)
- [Core] Introduce class variants for
Transformer2DModel
by @sayakpaul in #7647 - resolve comflicts by @toshas (direct commit on v0.28.1-patch)
- Tencent Hunyuan Team: add HunyuanDiT related updates by @gnobitab in #8240
- Tencent Hunyuan Team - Updated Doc for HunyuanDiT by @gnobitab in #8383
- [Transformer2DModel] Handle
norm_type
safely while remapping by @sayakpaul in #8370 - Release: v0.28.1 by @sayakpaul (direct commit on v0.28.1-patch)
Significant community contributions
The following contributors have made significant changes to the library over the last release: