Highlights:
- OpenGL backend
Full changelog:
- [OpenGL] [perf] Support ti.block_dim as block size hint (#1602) (by 彭于斌)
- [opengl] [refactor] Fix TLS not working and refactor ParallelSize for grid-stride-loop (#1600) (by 彭于斌)
- [async] Demote struct-fors in async compilation (#1593) (by Ye Kuang)
- [ir] Make sure "StmtFieldManager" to be correct if we modify some fields after the ctor (#1587) (by Xuanda Yang)
- [ipython] [refactor] Misc tweaks to make #1308 easier to review (#1584) (by 彭于斌)
- [OpenGL] [perf] Support TLS to improve reduction performance (#1574) (by 彭于斌)