Commit Graph

63 Commits

Author SHA1 Message Date
Philippe Tillet
28c250216c [dnn/gemm] added some bounds checking 2019-07-19 21:32:55 -07:00
Philippe Tillet
5215fb0424 [codegen] some more optimizations 2019-07-19 20:29:03 -07:00
Philippe Tillet
71594da66f [dnn/gemm]: fixed leading dimension in transposed variants 2019-07-18 16:35:48 -07:00
Philippe Tillet
f0d8306437 [codegen/alignment_info] better handling of constants 2019-07-18 16:12:06 -07:00
Philippe Tillet
86f70f8224 [codegen/selection] performance fix-up when A is transposed for hmma 2019-07-17 21:46:23 -07:00
Philippe Tillet
2f0817b2cd [codegen/selection] tensor cores now used for transposed layotus 2019-07-17 17:20:38 -07:00
Philippe Tillet
bfa39b8992 preparing the field for tensor cores transposes 2019-07-17 13:20:33 -07:00
Philippe Tillet
a55b098e88 [dnn/shift] now using constant divisions 2019-07-16 21:05:21 -07:00
Philippe Tillet
ec24e1e7df trying to remove interior logic 2019-07-16 18:47:50 -07:00
Philippe Tillet
164d85077f more stuff 2019-07-16 15:03:53 -07:00
Philippe Tillet
28959fe165 [runtime/jit] made auto-tuning silent 2019-07-16 14:41:38 -07:00
Philippe Tillet
7d1797cd32 ugh 2019-07-16 12:59:27 -07:00
Philippe Tillet
aa8bcf6bde [dnn/shift] added split-k for shift-conv 2019-07-15 21:03:58 -07:00
Philippe Tillet
434f65737f [runtime] put jit::launch_info in another file 2019-07-15 12:35:53 -07:00
Philippe Tillet
3c128fc2e2 [jit/autotune] added support for multi-threaded auto-tuning 2019-07-14 22:31:30 -07:00
Philippe Tillet
3e7a3ed67a [dnn/shift]: added support for fp16 2019-07-13 21:05:34 -07:00
Philippe Tillet
fe42cb7142 [dnn/shift] optimizations for NCHW layout 2019-07-12 20:22:32 -07:00
Philippe Tillet
7512c7ebed some cleaning 2019-07-12 20:03:05 -07:00
Philippe Tillet
b7986baffa [dnn]: Now implementing all existing DNN routines using common base template and auto-tuner 2019-07-09 19:52:55 -07:00
Philippe Tillet
88675fa01a [dnn] added base template class for mutualized auto-tuning 2019-07-09 16:09:34 -07:00
Philippe Tillet
1d88f0a36b stuff 2019-07-03 19:25:16 -07:00
Philippe Tillet
8fc253946c [codegen] shift: added sketch for shift-convolution backpropagation 2019-07-02 16:39:07 -07:00
Philippe Tillet
c172bd518b more stuff 2019-06-30 16:55:02 -07:00
Philippe Tillet
9a86bc51e1 [language] added alignment metadata for variables 2019-06-29 13:58:46 -07:00
Philippe Tillet
d8c3d58593 more optimization 2019-06-28 20:22:52 -07:00
Philippe Tillet
ab1afbf082 more performance optimizations 2019-06-28 17:04:07 -07:00
Philippe Tillet
a567f3f8a8 more cleaning 2019-06-28 15:10:39 -07:00
Philippe Tillet
21fd0fd65e fixup 2019-06-28 11:13:36 -07:00
Philippe Tillet
12e6036e5f trying interior shift 2019-06-27 14:13:48 -07:00
Philippe Tillet
d8526669f5 fixup 2019-06-27 12:39:17 -07:00
Philippe Tillet
9028e40f1d [dnn] added shift in the DNN libs 2019-06-27 11:37:19 -07:00
Philippe Tillet
6300ec5080 [examples] added conv2d op in tensorflow 2019-06-26 18:50:53 -07:00
Philippe Tillet
d945ce5e1b Now showing valid parameter for NN 2019-06-25 19:18:43 -07:00
Philippe Tillet
06b5992509 [feature] added basic tensor core support 2019-06-11 10:24:49 -07:00
Philippe Tillet
f58c9a4d2b [general] hmma baseline setup 2019-06-05 14:43:38 -07:00
Philippe Tillet
8102efc064 [triton/examples/cpp] removed common.hpp helper 2019-05-28 14:14:33 -04:00
Philippe Tillet
a9d078c06f [triton/dnn/conv] merged optimizations branch
- Added forward/backward support for strided convolution
- Added support for bias
- Added support for reduction splitting
2019-05-28 14:04:53 -04:00
Philippe Tillet
3f3eb1c2a4 [dnn/conv] Added the option to have look-up table for filters for all
operations
2019-05-22 19:03:33 -04:00
Philippe Tillet
e8f23bcade [dnn/conv] Added bias and forward stride 2019-05-22 13:27:08 -04:00
Philippe Tillet
f33a1f3fe3 [examples/pytorch] Fixed issues in backward pass of conv 2019-05-19 01:31:08 -04:00
Philippe Tillet
b2b55c52c9 [triton/python/conv]: Added cache for compiled kernels 2019-05-18 11:51:49 -04:00
Philippe Tillet
34f8617709 [dnn/conv] fixed formatting of generated Triton-C code 2019-05-16 15:48:02 -04:00
Philippe Tillet
ece7beea3c [dnn/conv]: now using look-up table for wgrad computation as well 2019-05-16 15:26:16 -04:00
Philippe Tillet
15a967c81e [dnn/conv] minor cleaning 2019-05-15 11:32:47 -04:00
Philippe Tillet
be2ba03382 [dnn/conv] optimizations of backpropagation with look-up tables 2019-05-14 19:10:59 -04:00
Philippe Tillet
5941501f70 [dnn] added Triton-C derivative computations in conv 2019-05-13 18:04:11 -04:00
Philippe Tillet
f6fe9492e4 [dnn/conv] added triton-c code for wgrad 2019-05-11 18:09:23 -04:00
Philippe Tillet
fc4daf11dd [examples/conv] now deferring shape computations to conv configuration 2019-05-08 13:58:25 -04:00
Philippe Tillet
54f888a270 [dnn/conv] some minor fixes 2019-05-08 10:09:30 -04:00
Philippe Tillet
615569287e more cleaning of conv 2019-05-06 19:30:22 -04:00