Commit Graph

3 Commits

Author SHA1 Message Date
Philippe Tillet
4a784ff13a [LANG] Now requiring tiles have power of 2 number of elements 2021-02-04 16:13:33 -05:00
Philippe Tillet
f278d9741a [GENERAL] Merged einsum feature branch. Various feature, performance
improvements and bugfixes:

* Added preliminary support for extended Einstein summation in PyTriton
* Significant performance improvement on FP32 kernels containing matrix
multiplication
* Added re-coalescing pass for FP16 kernels containing matrix
multiplication
* Various bugfixes
2020-01-20 12:42:48 -05:00
Philippe Tillet
f98b0b8e2a [general] deleted the old compiler frontend 2019-08-23 17:28:02 -07:00