Philippe Tillet
|
5db3a7adfe
|
[python][examples] some more cleaning of dot product example
|
2019-08-30 17:05:03 -07:00 |
|
Philippe Tillet
|
7e0af2118c
|
[codegen] worked around bug seemingly from nvptx/ptxas by simplifying multiplications by 1:
- Generated LLVM-IR looked correct
- Illegal addressing disappeared when running cuda-memcheck
- Illegal addressing disappeared when using nvptx-short-pointer
|
2019-08-30 16:45:14 -07:00 |
|
Philippe Tillet
|
141a823799
|
[python] refactoring in anticipation of pytorch support
|
2019-08-29 18:08:51 -07:00 |
|
Philippe Tillet
|
7cb73f66e2
|
testing some register gradient
|
2019-08-26 19:25:58 -07:00 |
|
Philippe Tillet
|
4075949f80
|
[python] basic tensorflow wrapper working
|
2019-08-26 16:53:49 -07:00 |
|
Philippe Tillet
|
321d268a4a
|
more progress
|
2019-08-25 21:26:09 -07:00 |
|
Philippe Tillet
|
81571246cf
|
[general] fixed some warnings
|
2019-08-18 14:08:57 -07:00 |
|
Philippe Tillet
|
b4a9ed9663
|
[python] added basic tensorflow support
|
2019-08-17 18:18:26 -07:00 |
|
Philippe Tillet
|
078f0052fe
|
more cleaning
|
2019-08-17 16:12:17 -07:00 |
|
Philippe Tillet
|
11a6a92598
|
[python][tensorflow] basic op generation is working
|
2019-08-16 20:50:18 -07:00 |
|
Philippe Tillet
|
c7cb5f82ad
|
[general] removed LLVM #include's in all Triton headers
|
2019-08-16 15:56:58 -07:00 |
|
Philippe Tillet
|
4de22df930
|
[python] added skeleton for python interface
|
2019-08-15 20:50:10 -07:00 |
|