Philippe Tillet
|
f36a646ffc
|
[dnn/shift-conv] added and tested NCHW layout
|
2019-07-11 21:00:33 -07:00 |
|
Philippe Tillet
|
fe8caf12f0
|
[dnn/conv]: skeleton for NCHW layout
|
2019-07-11 20:34:38 -07:00 |
|
Philippe Tillet
|
207e021973
|
[codegen/shift] substantial cleaning of triton-c shift-conv code
|
2019-07-11 20:11:23 -07:00 |
|
Philippe Tillet
|
75cf2df110
|
[dnn/shift] many bugfixes in strided shift-conv
|
2019-07-10 19:49:31 -07:00 |
|
Philippe Tillet
|
4ca83f1935
|
ugh bug in shift-conv striding
|
2019-07-10 17:00:22 -07:00 |
|
Philippe Tillet
|
f665c742f9
|
testing a simple shiftnet
|
2019-07-10 13:33:08 -07:00 |
|
Philippe Tillet
|
3b89bc8463
|
[examples/python/pytorch] added skeleton of wrapper for shift-conv and batch-norm
|
2019-07-09 21:54:37 -07:00 |
|
Philippe Tillet
|
63b249c1d6
|
[examples/python/pytorch] added batchnorm cpp extension
|
2019-07-09 20:59:04 -07:00 |
|
Philippe Tillet
|
b7986baffa
|
[dnn]: Now implementing all existing DNN routines using common base template and auto-tuner
|
2019-07-09 19:52:55 -07:00 |
|
Philippe Tillet
|
88675fa01a
|
[dnn] added base template class for mutualized auto-tuning
|
2019-07-09 16:09:34 -07:00 |
|
Philippe Tillet
|
066ae338f1
|
[dnn/shift]: added stride to shift
|
2019-07-09 14:08:51 -07:00 |
|
Philippe Tillet
|
cc41604784
|
[codegen/batchnorm] forward and backward now seemingly working
|
2019-07-09 13:03:16 -07:00 |
|
Philippe Tillet
|
f74dcb7e30
|
[dnn/batchnorm]: added some more code in Triton-C batchnorm implementations
|
2019-07-08 20:18:20 -07:00 |
|
Philippe Tillet
|
fa3270dcf2
|
[codegen/selection] bugfix in code generation for reduction instructions
|
2019-07-08 18:53:37 -07:00 |
|
Philippe Tillet
|
f9db0449b7
|
[dnn] Adding batchnorm
|
2019-07-08 18:44:37 -07:00 |
|
Philippe Tillet
|
b0cf3143c5
|
[dnn/shift] bugfix in wgrad
|
2019-07-06 11:27:49 -07:00 |
|
Philippe Tillet
|
3e49dbe6ab
|
[dnn/shift] fixed in leading dimensions for shift-conv operation
|
2019-07-05 17:17:22 -07:00 |
|
Philippe Tillet
|
c666f71fd6
|
fixed bug
|
2019-07-05 15:07:20 -07:00 |
|
Philippe Tillet
|
88ebdddf3d
|
makes more sense now
|
2019-07-03 20:45:03 -07:00 |
|
Philippe Tillet
|
bd1040510f
|
dx works but that makes no sense?
|
2019-07-03 20:24:52 -07:00 |
|
Philippe Tillet
|
1b2ceadf0d
|
weight gradient seem to work
|
2019-07-03 20:04:38 -07:00 |
|
Philippe Tillet
|
39aa22babb
|
more tinkering
|
2019-07-03 19:52:31 -07:00 |
|
Philippe Tillet
|
1d88f0a36b
|
stuff
|
2019-07-03 19:25:16 -07:00 |
|
Philippe Tillet
|
0d8faa5b1e
|
fixup
|
2019-07-02 21:38:10 -07:00 |
|
Philippe Tillet
|
5144dc3a6c
|
[examples/python] added framework code for shift-conv
|
2019-07-02 20:45:10 -07:00 |
|
Philippe Tillet
|
8fc253946c
|
[codegen] shift: added sketch for shift-convolution backpropagation
|
2019-07-02 16:39:07 -07:00 |
|
Philippe Tillet
|
6cfb575d29
|
[lang] fixup in cast type
|
2019-06-30 17:43:18 -07:00 |
|
Philippe Tillet
|
c172bd518b
|
more stuff
|
2019-06-30 16:55:02 -07:00 |
|
Philippe Tillet
|
9a86bc51e1
|
[language] added alignment metadata for variables
|
2019-06-29 13:58:46 -07:00 |
|
Philippe Tillet
|
d8c3d58593
|
more optimization
|
2019-06-28 20:22:52 -07:00 |
|
Philippe Tillet
|
83b753512c
|
prefetching with shift
|
2019-06-28 17:17:50 -07:00 |
|
Philippe Tillet
|
ab1afbf082
|
more performance optimizations
|
2019-06-28 17:04:07 -07:00 |
|
Philippe Tillet
|
a567f3f8a8
|
more cleaning
|
2019-06-28 15:10:39 -07:00 |
|
Philippe Tillet
|
21fd0fd65e
|
fixup
|
2019-06-28 11:13:36 -07:00 |
|
Philippe Tillet
|
f4dedb522c
|
fixup
|
2019-06-27 17:05:48 -07:00 |
|
Philippe Tillet
|
12e6036e5f
|
trying interior shift
|
2019-06-27 14:13:48 -07:00 |
|
Philippe Tillet
|
d8526669f5
|
fixup
|
2019-06-27 12:39:17 -07:00 |
|
Philippe Tillet
|
9028e40f1d
|
[dnn] added shift in the DNN libs
|
2019-06-27 11:37:19 -07:00 |
|
Philippe Tillet
|
6300ec5080
|
[examples] added conv2d op in tensorflow
|
2019-06-26 18:50:53 -07:00 |
|
Philippe Tillet
|
f1a8972267
|
[examples] added tensorflow dense convolution templates
|
2019-06-26 11:39:22 -07:00 |
|
Philippe Tillet
|
25e9a10917
|
changed auto-tuner parameter ranges
|
2019-06-25 19:27:49 -07:00 |
|
Philippe Tillet
|
d945ce5e1b
|
Now showing valid parameter for NN
|
2019-06-25 19:18:43 -07:00 |
|
Philippe Tillet
|
616f22c610
|
confirmed this is the fastest bounds checking
|
2019-06-25 16:35:43 -07:00 |
|
Philippe Tillet
|
64513fb407
|
[codegen] added fallback when tensor cores cannot be used
|
2019-06-25 15:49:58 -07:00 |
|
Philippe Tillet
|
62000738f0
|
[codegen] renamed axis_info -> alignment_info
|
2019-06-25 15:10:47 -07:00 |
|
Philippe Tillet
|
d52abc9379
|
[codegen] bugfix in alignment inference
|
2019-06-25 15:06:15 -07:00 |
|
Philippe Tillet
|
edc31cabb0
|
[codegen] rough template for axis_info pass
|
2019-06-24 18:57:32 -07:00 |
|
Philippe Tillet
|
72867d17d4
|
more cleaning
|
2019-06-24 12:37:13 -07:00 |
|
Philippe Tillet
|
f257884eb7
|
some cleaning
|
2019-06-24 09:31:34 -07:00 |
|
Philippe Tillet
|
67989e7d18
|
fixup
|
2019-06-13 20:03:28 -07:00 |
|