Highlights:
- Bug fixes
- Fix floordiv overflow when
lhs * rhs
is large (#970) (by 彭于斌)
- Fix floordiv overflow when
- Language and syntax
- Refactor SNode and support coordinate offsets (#945) (by Yuanming Hu)
- Metal backend
- Skip allocating Runtime::ListManager if no sparse SNode is used (#976) (by Ye Kuang)
Full changelog:
- [release] v0.6.4 (by Yuanming Hu)
- [doc] Minor updates in
hello.rst
(#963) (by Yuanming Hu) - [doc] Improve
hello.rst
(#972) (by Xiao Zhai) - [misc] Add a link to taichi_elements in README.md (#974) (by 彭于斌)
- [Bug] [opengl] [llvm] [metal] Fix floordiv overflow when
lhs * rhs
is large (#970) (by 彭于斌) - [test] Size Performance Regression Tests (SPRT) (#959) (by 彭于斌)
- [Metal] Skip allocating Runtime::ListManager if no sparse SNode is used (#976) (by Ye Kuang)
- [cuda] Improve kernel return value performance when unified memory is available (#965) (by Yuanming Hu)
- [doc] Update
syntax.rst
and related sections (#967) (by Yuanming Hu) - [opengl] add TI_WITH_OPENGL env var to disable OpenGL (#962) (by 彭于斌)
- [opt] [cuda] Fix constant folding in multithreaded CUDA applications (#954) (by Yuanming Hu)
- [ir][refactor] Avoid throwing exception in
replace_statements_with
(#968) (by xumingkuan) - [opengl] Use
compile_to_offloads
for IR lowering (#951) (by 彭于斌) - [Doc] fix typo in syntax_sugars.po (#960) (by Danni)
- [refactor][ir] Simplify ir.h to reduce compile-time (#934) (by 彭于斌)
- [lang] [refactor] deprecate @boardcast_if_scalar, all use @binary and @unary (#943) (by 彭于斌)
- [doc] Improve
hello.rst
(#955) (by Yuanming Hu) - [metal] Set labels on the command buffer/encoder (#950) (by Ye Kuang)
- [ir] Remove BasicStmtVisitor::current_struct_for (#952) (by xumingkuan)
- [misc] README.md use shields.io to show latest release (#947) (by 彭于斌)
- [Lang][IR][refactor] Refactor SNode and support coordinate offsets (#945) (by Yuanming Hu)