Philippe Tillet
|
49fcfd6fc7
|
[examples/tensorflow] fixed #include issue
|
2019-06-05 11:09:41 -07:00 |
|
Philippe Tillet
|
a9d078c06f
|
[triton/dnn/conv] merged optimizations branch
- Added forward/backward support for strided convolution
- Added support for bias
- Added support for reduction splitting
|
2019-05-28 14:04:53 -04:00 |
|
Philippe Tillet
|
e526ffc62b
|
[examples/pytorch] added a bunch of models for more thorough testing
|
2019-05-28 14:04:31 -04:00 |
|
Philippe Tillet
|
e8f23bcade
|
[dnn/conv] Added bias and forward stride
|
2019-05-22 13:27:08 -04:00 |
|
Philippe Tillet
|
f33a1f3fe3
|
[examples/pytorch] Fixed issues in backward pass of conv
|
2019-05-19 01:31:08 -04:00 |
|
Philippe Tillet
|
b2b55c52c9
|
[triton/python/conv]: Added cache for compiled kernels
|
2019-05-18 11:51:49 -04:00 |
|
Philippe Tillet
|
600aef72d5
|
[conv/dnn] now created a separate .h and .cpp file
|
2019-05-17 12:29:11 -04:00 |
|
Philippe Tillet
|
5941501f70
|
[dnn] added Triton-C derivative computations in conv
|
2019-05-13 18:04:11 -04:00 |
|
Philippe Tillet
|
fd91368f98
|
[general] creation of dnn module for gemm/conv triton routines
|
2019-05-06 17:47:06 -04:00 |
|
Philippe Tillet
|
f80441017c
|
[codegen] added leading dimension padding for transposition in shared
memory
|
2019-05-06 11:53:35 -04:00 |
|
Philippe Tillet
|
30833c18f1
|
[codegen/tune] bugfix in heuristics for nano-tile sizes
|
2019-05-04 01:32:34 -04:00 |
|
Philippe Tillet
|
0d694445e6
|
[examples] added skeleton for pytorch wrapper
|
2019-05-03 14:30:06 -04:00 |
|
Philippe Tillet
|
70f49a56c1
|
[examples/python/tensorflow] better skeleton for blocksparse
|
2019-05-01 17:09:01 -04:00 |
|
Philippe Tillet
|
55866f1ef6
|
[examples/python/tensorflow] fixed ABI compatibility issue in JIT
destructor
|
2019-05-01 13:38:56 -04:00 |
|
Philippe Tillet
|
7b6efc0463
|
[examples/python/tensorflow] bugfix in tensorflow wrapper example
|
2019-04-30 21:04:30 -04:00 |
|
Philippe Tillet
|
d934d8fb40
|
[examples/python/tensorflow] improved matmul wrapper
|
2019-04-30 12:25:35 -04:00 |
|
Philippe Tillet
|
8e809a9536
|
[examples] added skeleton for tensorflow op
|
2019-04-30 10:50:54 -04:00 |
|