facebookresearch/xformers v0.0.19 on GitHub

[0.0.19] - 2023-04-28

Fixed performance regression with nvcc>11.6 (#712)
fMHA/cutlass: Fixed nan in the output when using a torch.Tensor with -inf prefixes as attn_bias (#722)
fMHA/cutlass: Fixed nan in the output when the sequence length is larger than 2 ** 15 (#719)
fMHA/cutlass: Significative performance improvements (up to 2x) for both the forward pass and backward pass
fMHA/cutlass: The kernel are now deterministic
fMHA/cutlass: Fixed backward pass correctness when using dropout (#724)