Philippe Tillet
|
5995cbff8e
|
[CORE] Auto-tuning now copies scalar buffers. Still needs to copy all buffers that are both read from and written to.
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
78cd54b0c8
|
[PYTHON] Added support for FP16 scalar kernel arguments
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
609ef3a24d
|
[CORE] Fixed bug for Multi-GPU
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
435acbf585
|
[PACKAGING] Added MANIFEST.in and some symlinks for better packaging
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
f805ff278a
|
[PYTHON][SRC][BINDING] Improved code portability across compilers
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
dfb844bf41
|
[GENERAL] Improved caching mechanism:
* Now computing hash in libtriton
* Now only compiling a single pytorch hook per function signature
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
404dd18333
|
[PYTHON][CORE] Deprecating Tensorflow support
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
6d7cf35123
|
History prior to this date belonged to the now deprecated ISAAC project, and was deleted to save space
|
2021-07-27 12:38:38 -07:00 |
|