Philippe Tillet
|
c0bc7ed8b0
|
[PYTHON] Added TRITON_DEBUG_MODE which reallocates input tensors outside of the pytorch memory pool to spot out-of-bounds accesses more easily
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
8ab62803db
|
[PYTHON] Context switching logic moved to PyTorch
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
4f08d87fed
|
[DRIVER] Simplified Driver API by substantially removing reliance on driver::context
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
a77c925dfd
|
[DRIVER] Improved performance of Host driver code
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
8f8d36c7a4
|
[GENERAL] Various bugfixes
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
8f3ee53f24
|
[PYTHON] Added option to show PTX source code in Python
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
06abc8cb40
|
[GENERAL] Fix compatibility issue with older Torch versions
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
02a6e81b88
|
[PYTHON] Cleaning C++ bindings
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
840308ab5d
|
[CODEGEN] More work on the CPU backend
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
7af9d812cf
|
[PYTHON] Added credits to Scott Gray for the idea used in launch.cc
|
2021-07-27 12:38:48 -07:00 |
|
Philippe Tillet
|
acff1b5e05
|
[RUNTIME] Lower-level interface for executing functions
|
2021-07-27 12:38:48 -07:00 |
|