Philippe Tillet
|
28c250216c
|
[dnn/gemm] added some bounds checking
|
2019-07-19 21:32:55 -07:00 |
|
Philippe Tillet
|
5215fb0424
|
[codegen] some more optimizations
|
2019-07-19 20:29:03 -07:00 |
|
Philippe Tillet
|
71594da66f
|
[dnn/gemm]: fixed leading dimension in transposed variants
|
2019-07-18 16:35:48 -07:00 |
|
Philippe Tillet
|
f0d8306437
|
[codegen/alignment_info] better handling of constants
|
2019-07-18 16:12:06 -07:00 |
|
Philippe Tillet
|
86f70f8224
|
[codegen/selection] performance fix-up when A is transposed for hmma
|
2019-07-17 21:46:23 -07:00 |
|
Philippe Tillet
|
2f0817b2cd
|
[codegen/selection] tensor cores now used for transposed layotus
|
2019-07-17 17:20:38 -07:00 |
|
Philippe Tillet
|
bfa39b8992
|
preparing the field for tensor cores transposes
|
2019-07-17 13:20:33 -07:00 |
|
Philippe Tillet
|
a55b098e88
|
[dnn/shift] now using constant divisions
|
2019-07-16 21:05:21 -07:00 |
|
Philippe Tillet
|
ec24e1e7df
|
trying to remove interior logic
|
2019-07-16 18:47:50 -07:00 |
|
Philippe Tillet
|
164d85077f
|
more stuff
|
2019-07-16 15:03:53 -07:00 |
|
Philippe Tillet
|
28959fe165
|
[runtime/jit] made auto-tuning silent
|
2019-07-16 14:41:38 -07:00 |
|
Philippe Tillet
|
7d1797cd32
|
ugh
|
2019-07-16 12:59:27 -07:00 |
|
Philippe Tillet
|
aa8bcf6bde
|
[dnn/shift] added split-k for shift-conv
|
2019-07-15 21:03:58 -07:00 |
|
Philippe Tillet
|
434f65737f
|
[runtime] put jit::launch_info in another file
|
2019-07-15 12:35:53 -07:00 |
|
Philippe Tillet
|
3c128fc2e2
|
[jit/autotune] added support for multi-threaded auto-tuning
|
2019-07-14 22:31:30 -07:00 |
|
Philippe Tillet
|
3e7a3ed67a
|
[dnn/shift]: added support for fp16
|
2019-07-13 21:05:34 -07:00 |
|
Philippe Tillet
|
fe42cb7142
|
[dnn/shift] optimizations for NCHW layout
|
2019-07-12 20:22:32 -07:00 |
|
Philippe Tillet
|
7512c7ebed
|
some cleaning
|
2019-07-12 20:03:05 -07:00 |
|
Philippe Tillet
|
b7986baffa
|
[dnn]: Now implementing all existing DNN routines using common base template and auto-tuner
|
2019-07-09 19:52:55 -07:00 |
|
Philippe Tillet
|
88675fa01a
|
[dnn] added base template class for mutualized auto-tuning
|
2019-07-09 16:09:34 -07:00 |
|
Philippe Tillet
|
1d88f0a36b
|
stuff
|
2019-07-03 19:25:16 -07:00 |
|
Philippe Tillet
|
8fc253946c
|
[codegen] shift: added sketch for shift-convolution backpropagation
|
2019-07-02 16:39:07 -07:00 |
|
Philippe Tillet
|
c172bd518b
|
more stuff
|
2019-06-30 16:55:02 -07:00 |
|
Philippe Tillet
|
9a86bc51e1
|
[language] added alignment metadata for variables
|
2019-06-29 13:58:46 -07:00 |
|
Philippe Tillet
|
d8c3d58593
|
more optimization
|
2019-06-28 20:22:52 -07:00 |
|
Philippe Tillet
|
ab1afbf082
|
more performance optimizations
|
2019-06-28 17:04:07 -07:00 |
|
Philippe Tillet
|
a567f3f8a8
|
more cleaning
|
2019-06-28 15:10:39 -07:00 |
|
Philippe Tillet
|
21fd0fd65e
|
fixup
|
2019-06-28 11:13:36 -07:00 |
|
Philippe Tillet
|
12e6036e5f
|
trying interior shift
|
2019-06-27 14:13:48 -07:00 |
|
Philippe Tillet
|
d8526669f5
|
fixup
|
2019-06-27 12:39:17 -07:00 |
|
Philippe Tillet
|
9028e40f1d
|
[dnn] added shift in the DNN libs
|
2019-06-27 11:37:19 -07:00 |
|
Philippe Tillet
|
6300ec5080
|
[examples] added conv2d op in tensorflow
|
2019-06-26 18:50:53 -07:00 |
|
Philippe Tillet
|
d945ce5e1b
|
Now showing valid parameter for NN
|
2019-06-25 19:18:43 -07:00 |
|
Philippe Tillet
|
06b5992509
|
[feature] added basic tensor core support
|
2019-06-11 10:24:49 -07:00 |
|
Philippe Tillet
|
f58c9a4d2b
|
[general] hmma baseline setup
|
2019-06-05 14:43:38 -07:00 |
|
Philippe Tillet
|
8102efc064
|
[triton/examples/cpp] removed common.hpp helper
|
2019-05-28 14:14:33 -04:00 |
|
Philippe Tillet
|
a9d078c06f
|
[triton/dnn/conv] merged optimizations branch
- Added forward/backward support for strided convolution
- Added support for bias
- Added support for reduction splitting
|
2019-05-28 14:04:53 -04:00 |
|
Philippe Tillet
|
3f3eb1c2a4
|
[dnn/conv] Added the option to have look-up table for filters for all
operations
|
2019-05-22 19:03:33 -04:00 |
|
Philippe Tillet
|
e8f23bcade
|
[dnn/conv] Added bias and forward stride
|
2019-05-22 13:27:08 -04:00 |
|
Philippe Tillet
|
f33a1f3fe3
|
[examples/pytorch] Fixed issues in backward pass of conv
|
2019-05-19 01:31:08 -04:00 |
|
Philippe Tillet
|
b2b55c52c9
|
[triton/python/conv]: Added cache for compiled kernels
|
2019-05-18 11:51:49 -04:00 |
|
Philippe Tillet
|
34f8617709
|
[dnn/conv] fixed formatting of generated Triton-C code
|
2019-05-16 15:48:02 -04:00 |
|
Philippe Tillet
|
ece7beea3c
|
[dnn/conv]: now using look-up table for wgrad computation as well
|
2019-05-16 15:26:16 -04:00 |
|
Philippe Tillet
|
15a967c81e
|
[dnn/conv] minor cleaning
|
2019-05-15 11:32:47 -04:00 |
|
Philippe Tillet
|
be2ba03382
|
[dnn/conv] optimizations of backpropagation with look-up tables
|
2019-05-14 19:10:59 -04:00 |
|
Philippe Tillet
|
5941501f70
|
[dnn] added Triton-C derivative computations in conv
|
2019-05-13 18:04:11 -04:00 |
|
Philippe Tillet
|
f6fe9492e4
|
[dnn/conv] added triton-c code for wgrad
|
2019-05-11 18:09:23 -04:00 |
|
Philippe Tillet
|
fc4daf11dd
|
[examples/conv] now deferring shape computations to conv configuration
|
2019-05-08 13:58:25 -04:00 |
|
Philippe Tillet
|
54f888a270
|
[dnn/conv] some minor fixes
|
2019-05-08 10:09:30 -04:00 |
|
Philippe Tillet
|
615569287e
|
more cleaning of conv
|
2019-05-06 19:30:22 -04:00 |
|