Commit Graph

21 Commits

Author SHA1 Message Date
Philippe Tillet
3e7a3ed67a [dnn/shift]: added support for fp16 2019-07-13 21:05:34 -07:00
Philippe Tillet
fe42cb7142 [dnn/shift] optimizations for NCHW layout 2019-07-12 20:22:32 -07:00
Philippe Tillet
7512c7ebed some cleaning 2019-07-12 20:03:05 -07:00
Philippe Tillet
b7986baffa [dnn]: Now implementing all existing DNN routines using common base template and auto-tuner 2019-07-09 19:52:55 -07:00
Philippe Tillet
88675fa01a [dnn] added base template class for mutualized auto-tuning 2019-07-09 16:09:34 -07:00
Philippe Tillet
1d88f0a36b stuff 2019-07-03 19:25:16 -07:00
Philippe Tillet
8fc253946c [codegen] shift: added sketch for shift-convolution backpropagation 2019-07-02 16:39:07 -07:00
Philippe Tillet
c172bd518b more stuff 2019-06-30 16:55:02 -07:00
Philippe Tillet
9a86bc51e1 [language] added alignment metadata for variables 2019-06-29 13:58:46 -07:00
Philippe Tillet
d8c3d58593 more optimization 2019-06-28 20:22:52 -07:00
Philippe Tillet
ab1afbf082 more performance optimizations 2019-06-28 17:04:07 -07:00
Philippe Tillet
a567f3f8a8 more cleaning 2019-06-28 15:10:39 -07:00
Philippe Tillet
21fd0fd65e fixup 2019-06-28 11:13:36 -07:00
Philippe Tillet
12e6036e5f trying interior shift 2019-06-27 14:13:48 -07:00
Philippe Tillet
d8526669f5 fixup 2019-06-27 12:39:17 -07:00
Philippe Tillet
9028e40f1d [dnn] added shift in the DNN libs 2019-06-27 11:37:19 -07:00
Philippe Tillet
8102efc064 [triton/examples/cpp] removed common.hpp helper 2019-05-28 14:14:33 -04:00
Philippe Tillet
a9d078c06f [triton/dnn/conv] merged optimizations branch
- Added forward/backward support for strided convolution
- Added support for bias
- Added support for reduction splitting
2019-05-28 14:04:53 -04:00
Philippe Tillet
b2b55c52c9 [triton/python/conv]: Added cache for compiled kernels 2019-05-18 11:51:49 -04:00
Philippe Tillet
4b77b764ba [triton-c] added support for while loops 2019-04-26 15:08:02 -04:00
Philippe Tillet
3413aad582 [general] major overhaul of triton-c/triton-ir/triton-jit:
- Added alloc const
- Added atomics
- Pruning tuning space
- Added example for dot/conv/shift
- Bugfixes
2019-04-25 16:18:15 -04:00