Philippe Tillet
|
3c128fc2e2
|
[jit/autotune] added support for multi-threaded auto-tuning
|
2019-07-14 22:31:30 -07:00 |
|
Philippe Tillet
|
3e7a3ed67a
|
[dnn/shift]: added support for fp16
|
2019-07-13 21:05:34 -07:00 |
|
Philippe Tillet
|
fe42cb7142
|
[dnn/shift] optimizations for NCHW layout
|
2019-07-12 20:22:32 -07:00 |
|
Philippe Tillet
|
7512c7ebed
|
some cleaning
|
2019-07-12 20:03:05 -07:00 |
|
Philippe Tillet
|
b7986baffa
|
[dnn]: Now implementing all existing DNN routines using common base template and auto-tuner
|
2019-07-09 19:52:55 -07:00 |
|
Philippe Tillet
|
88675fa01a
|
[dnn] added base template class for mutualized auto-tuning
|
2019-07-09 16:09:34 -07:00 |
|
Philippe Tillet
|
1d88f0a36b
|
stuff
|
2019-07-03 19:25:16 -07:00 |
|
Philippe Tillet
|
8fc253946c
|
[codegen] shift: added sketch for shift-convolution backpropagation
|
2019-07-02 16:39:07 -07:00 |
|
Philippe Tillet
|
c172bd518b
|
more stuff
|
2019-06-30 16:55:02 -07:00 |
|
Philippe Tillet
|
9a86bc51e1
|
[language] added alignment metadata for variables
|
2019-06-29 13:58:46 -07:00 |
|
Philippe Tillet
|
d8c3d58593
|
more optimization
|
2019-06-28 20:22:52 -07:00 |
|
Philippe Tillet
|
ab1afbf082
|
more performance optimizations
|
2019-06-28 17:04:07 -07:00 |
|
Philippe Tillet
|
a567f3f8a8
|
more cleaning
|
2019-06-28 15:10:39 -07:00 |
|
Philippe Tillet
|
21fd0fd65e
|
fixup
|
2019-06-28 11:13:36 -07:00 |
|
Philippe Tillet
|
12e6036e5f
|
trying interior shift
|
2019-06-27 14:13:48 -07:00 |
|
Philippe Tillet
|
d8526669f5
|
fixup
|
2019-06-27 12:39:17 -07:00 |
|
Philippe Tillet
|
9028e40f1d
|
[dnn] added shift in the DNN libs
|
2019-06-27 11:37:19 -07:00 |
|
Philippe Tillet
|
8102efc064
|
[triton/examples/cpp] removed common.hpp helper
|
2019-05-28 14:14:33 -04:00 |
|
Philippe Tillet
|
a9d078c06f
|
[triton/dnn/conv] merged optimizations branch
- Added forward/backward support for strided convolution
- Added support for bias
- Added support for reduction splitting
|
2019-05-28 14:04:53 -04:00 |
|
Philippe Tillet
|
b2b55c52c9
|
[triton/python/conv]: Added cache for compiled kernels
|
2019-05-18 11:51:49 -04:00 |
|
Philippe Tillet
|
4b77b764ba
|
[triton-c] added support for while loops
|
2019-04-26 15:08:02 -04:00 |
|
Philippe Tillet
|
3413aad582
|
[general] major overhaul of triton-c/triton-ir/triton-jit:
- Added alloc const
- Added atomics
- Pruning tuning space
- Added example for dot/conv/shift
- Bugfixes
|
2019-04-25 16:18:15 -04:00 |
|