Added
- Faster non-commutative reductions in the GPU backends. Work by
Anders Holst and Christian Påbøl Jacobsen.
Fixed
-
Interpreter crash for certain complicated size expressions involving
internal bindings (#2053). -
Incorrect type checking of
let
binding with explicit size
quantification, where size appears in type of body (#2048). -
GPU code generation for non-commutative non-segmented reductions
with array operands (#2051). -
Histogram with non-vectorised reduction operators (#2056). (But it
is probably not a good idea to write such programs.) -
Futhark's LSP server should work better with Eglot.
-
Incorrect copy removal inside histograms could cause compiler error
(#2058). -
CUDA backend now correctly queries for available shared memory,
which affects performance (hopefully positively). -
futhark literate
now switches to the directory containing the
.fut
file before executing its contents. This fixes accessing
files through relative paths.