Bug Fixes & Improvements
- Autodiff: fixed RAM memory leak with correct graph cleanup (#3957 #3982) @laggui
- Better memory reuse: improved sliced memory pool implementation (#3941) @nathanielsimard
- Cuda: update
cudarc, auto-detect CUDA version and fix some 12.8 features (CubeCL #1008) @wingertge - Quantized Linear: fixed fusion configuration to fuse more precisions (#3941) @nathanielsimard
- PyTorch import: fixed pickle reader regression with integer dictionary keys (#3978) @laggui
- Docs: switched to
doc_cfgto fixdocs.rsbuilds (#3979) @laggui - Tensor API fixes: