Commit Graph

21 Commits

Author SHA1 Message Date
Philippe Tillet
f36a646ffc [dnn/shift-conv] added and tested NCHW layout 2019-07-11 21:00:33 -07:00
Philippe Tillet
207e021973 [codegen/shift] substantial cleaning of triton-c shift-conv code 2019-07-11 20:11:23 -07:00
Philippe Tillet
75cf2df110 [dnn/shift] many bugfixes in strided shift-conv 2019-07-10 19:49:31 -07:00
Philippe Tillet
4ca83f1935 ugh bug in shift-conv striding 2019-07-10 17:00:22 -07:00
Philippe Tillet
f665c742f9 testing a simple shiftnet 2019-07-10 13:33:08 -07:00
Philippe Tillet
3b89bc8463 [examples/python/pytorch] added skeleton of wrapper for shift-conv and batch-norm 2019-07-09 21:54:37 -07:00
Philippe Tillet
63b249c1d6 [examples/python/pytorch] added batchnorm cpp extension 2019-07-09 20:59:04 -07:00
Philippe Tillet
b7986baffa [dnn]: Now implementing all existing DNN routines using common base template and auto-tuner 2019-07-09 19:52:55 -07:00
Philippe Tillet
88675fa01a [dnn] added base template class for mutualized auto-tuning 2019-07-09 16:09:34 -07:00
Philippe Tillet
f9db0449b7 [dnn] Adding batchnorm 2019-07-08 18:44:37 -07:00
Philippe Tillet
a9d078c06f [triton/dnn/conv] merged optimizations branch
- Added forward/backward support for strided convolution
- Added support for bias
- Added support for reduction splitting
2019-05-28 14:04:53 -04:00
Philippe Tillet
e526ffc62b [examples/pytorch] added a bunch of models for more thorough testing 2019-05-28 14:04:31 -04:00
Philippe Tillet
e8f23bcade [dnn/conv] Added bias and forward stride 2019-05-22 13:27:08 -04:00
Philippe Tillet
f33a1f3fe3 [examples/pytorch] Fixed issues in backward pass of conv 2019-05-19 01:31:08 -04:00
Philippe Tillet
b2b55c52c9 [triton/python/conv]: Added cache for compiled kernels 2019-05-18 11:51:49 -04:00
Philippe Tillet
600aef72d5 [conv/dnn] now created a separate .h and .cpp file 2019-05-17 12:29:11 -04:00
Philippe Tillet
5941501f70 [dnn] added Triton-C derivative computations in conv 2019-05-13 18:04:11 -04:00
Philippe Tillet
fd91368f98 [general] creation of dnn module for gemm/conv triton routines 2019-05-06 17:47:06 -04:00
Philippe Tillet
f80441017c [codegen] added leading dimension padding for transposition in shared
memory
2019-05-06 11:53:35 -04:00
Philippe Tillet
30833c18f1 [codegen/tune] bugfix in heuristics for nano-tile sizes 2019-05-04 01:32:34 -04:00
Philippe Tillet
0d694445e6 [examples] added skeleton for pytorch wrapper 2019-05-03 14:30:06 -04:00