Notable changes:
- (Mar 28, 2020) v0.5.9 released
- CPU & CUDA backends
- Support
bitmasked
as the leaf block structure for1x1x1
masks (#676) (by Yuanming Hu)
- Support
- Documentation
- Updated contributor guideline (#658) (by Yuanming Hu)
- Infrastructure
- 6x faster compilation on CPU/CUDA backends (#673) (by Yuanming Hu)
- Language and syntax
- Metal backend
- Optimization
- CPU & CUDA backends
Full changelog:
- [misc]
misc/make_changelog.py
for automatically generating changelogs (#679) (by Yuanming Hu) - [metal] Simplify Metal backend's namings (#675) (by Ye Kuang)
- [CPU][CUDA] Support
bitmasked
as the leaf block structure for1x1x1
masks (#676) (by Yuanming Hu) - [Infra] 6x faster compilation on CPU backends (#673) (by Yuanming Hu)
- [misc] Improve format server stability (#672) (by Yuanming Hu)
- [ir] Basic function definition/call instructions (#612) (by 彭于斌)
- [Lang] Simplify dense.bitmasked to bitmasked (#670) (by Ye Kuang)
- [misc] Fixed format server file coverage (#669) (by Yuanming Hu)
- [Opt] Merge adjacent if's with identical conditions (#668) (by xumingkuan)
- [metal] Move platform/metal to backends/metal (#667) (by Ye Kuang)
- [ir] Added irpass::gather_statements (#665) (by Yuanming Hu)
- [Opt] Dive into container statements to find local loads/stores for optimization, and optimize loads of new allocas to 0 (#662) (by xumingkuan)
- [Metal] Changes to enable
bitmasked
on Metal! (#661) (by Ye Kuang) - [Doc] Updated contributor guideline (#658) (by Yuanming Hu)
- [misc] Introduced a temporary boolean constant for benchmarking advanced optimizations (#657) (by xumingkuan)
- [misc] v0.5.8 README (#654) (by Yuanming Hu)
- Fixed MGPCG (#652) (by Yuanming Hu)
- Refactor ASTTransformer.visit_For and fix a bug on grouped ndrange loops (#648) (by xumingkuan)
- [Metal] Silence compile warning with [[maybe_unused]] (#650) (by Ye Kuang)
- add LineAppender for OpenGL too #643 (#651) (by 彭于斌)
- Support break in non-parallel for statements by translate range-for into while #578 (#583) (by 彭于斌)
- [Metal] Add bitmasked support in MetalRuntime (#638) (by Ye Kuang)