Philippe Tillet
e7461a862b
[CODEGEN] Bugfix in Disassociate pass; Added fp32 atomic_add support
2021-07-27 12:38:48 -07:00
Philippe Tillet
bb2d98ce4b
[LANG] Added support for flattening
2021-07-27 12:38:48 -07:00
Philippe Tillet
13ff6472e0
[LANG] Fixed undefined behavior in replace_all_uses_with()
2021-07-27 12:38:48 -07:00
Philippe Tillet
ddd89e1b22
[GENERAL] Fixed some undefined behavior with GCC-9
2021-07-27 12:38:48 -07:00
Philippe Tillet
0516ea96d0
[CODEGEN] Fixed bug that caused missing recoalescing for some transpose
...
operations
2021-07-27 12:38:48 -07:00
Philippe Tillet
c73dee080c
[CODEGEN] Fixed bug for phi nodes with constant incoming value
2021-07-27 12:38:48 -07:00
Philippe Tillet
54805596f5
[CODEGEN][ANALYSIS] bugfix in alignment analysis
2021-07-27 12:38:48 -07:00
Philippe Tillet
677ccfb44e
[CORE][RUNTIME] Better error message on internal compilation error
2021-07-27 12:38:48 -07:00
Philippe Tillet
5943baa53f
[GENERAL] Error messages now no longer make terminal color green
2021-07-27 12:38:48 -07:00
Philippe Tillet
3304629de9
[CORE] Fixed several issues that arose in the development of the
...
torch-blocksparse package:
* Now using warp shuffle in reductions when possible
* Various bugfixes in layout inference
* Added INFINITY, exponential and select
* Better error messages for unimplemented constructs
2021-07-27 12:38:48 -07:00
Philippe Tillet
9cb3fd899a
[CORE][DRIVER] Now only using PTX6.4 if CUDA10.1+ is detected
2021-07-27 12:38:48 -07:00
Philippe Tillet
268894a5ce
[PYTHON] Merged blocksparse branch:
...
* Example for blocksparse matrix multiplication
* Simplified Triton kernel API
* Revived auto-tuning in einsum
2021-07-27 12:38:48 -07:00
Philippe Tillet
dfb844bf41
[GENERAL] Improved caching mechanism:
...
* Now computing hash in libtriton
* Now only compiling a single pytorch hook per function signature
2021-07-27 12:38:48 -07:00
Philippe Tillet
4181f9f2af
[CODEGEN][TRANSFORM][PEEPHOLE] Fixed bug in *1 multiplication
2021-07-27 12:38:48 -07:00
Philippe Tillet
3816f2f259
[PYTHON][EINSUM] Now handling reduction sizes that are not a multiple of
...
TK
2021-07-27 12:38:48 -07:00
Philippe Tillet
d22cf4f717
[TRITON][LANG] Added support for bitcast
2021-07-27 12:38:48 -07:00
Philippe Tillet
6d7cf35123
History prior to this date belonged to the now deprecated ISAAC project, and was deleted to save space
2021-07-27 12:38:38 -07:00