- More assembly optimizations for x86_64
- Optimizations in the plane/region logic
- Optimizations in the motion estimation code
- Optimizations in the
cdef_dist
code (AVX2 and general logic) - Optimizations in the entropy coder
- Avoid an UB in
cdf_context
- Leverage const-generics to improve the CDF bookkeeping/rollback system
- Additional fixes