Highlights:
- Examples
- Fix euler.py example (#1999) (by Yadi Cao)
- Performance improvements
- Automatically use read-only data cache load (#1998) (by Yuanming Hu)
Full changelog:
- [metal] Add SNodeRep_pointer (#2000) (by Ye Kuang)
- [misc] Add a link to Chinese video tutorial on Bilibili (#2003) (by Yuanming Hu)
- [async] [bug] Fix a bug in fusion and add some debug info (#2002) (by xumingkuan)
- [Perf] [cuda] Automatically use read-only data cache load (#1998) (by Yuanming Hu)
- [Example] Fix euler.py example (#1999) (by Yadi Cao)
- [metal] Refactor sparse shader impl in prep for pointer SNode (#1994) (by Ye Kuang)
- [error] Improve kernel argument type mismatch error message (#1995) (by Yuanming Hu)