Files
triton/lib/driver
Philippe Tillet 15f8e8c3b7 [CODEGEN] Major performance improvements on A100 (#70)
Improved handling of asynchronous copy, scheduling and synchronization for A100. Now achieving CUTLASS-like performance on large square dense matrix multiplication tasks
2021-02-21 18:19:39 -05:00
..
2020-11-26 23:12:39 -05:00
2020-10-13 20:57:32 -07:00
2020-10-13 20:57:32 -07:00
2020-10-13 20:57:32 -07:00