Philippe Tillet
|
61f25f90eb
|
basic parsing doesn't throw error
|
2019-08-20 16:22:43 -07:00 |
|
Philippe Tillet
|
bc11e31419
|
[lang] more progress on parser
|
2019-08-19 20:56:39 -07:00 |
|
Philippe Tillet
|
81571246cf
|
[general] fixed some warnings
|
2019-08-18 14:08:57 -07:00 |
|
Philippe Tillet
|
b4a9ed9663
|
[python] added basic tensorflow support
|
2019-08-17 18:18:26 -07:00 |
|
Philippe Tillet
|
11a6a92598
|
[python][tensorflow] basic op generation is working
|
2019-08-16 20:50:18 -07:00 |
|
Philippe Tillet
|
c7cb5f82ad
|
[general] removed LLVM #include's in all Triton headers
|
2019-08-16 15:56:58 -07:00 |
|
Philippe Tillet
|
3ece461ce2
|
added tensorflow code generator
|
2019-08-15 15:59:53 -07:00 |
|
Philippe Tillet
|
38a8b0ab19
|
[runtime] overall of the run-time API
|
2019-08-14 20:26:11 -07:00 |
|
Philippe Tillet
|
1400d960a6
|
[auto-tuning] much smaller parameters space
|
2019-08-12 21:15:21 -07:00 |
|
Philippe Tillet
|
fd49cdc92b
|
[dnn][blocksparse] added dw code
|
2019-08-08 19:15:35 -07:00 |
|
Philippe Tillet
|
f93099bda1
|
[codegen][transform][trans] fixed incorrect replace_all_uses_with
|
2019-08-07 21:50:16 -07:00 |
|
Philippe Tillet
|
7578c27d3d
|
[general][filesystem] added structure and namespace to code generation files
|
2019-08-07 21:17:17 -07:00 |
|
Philippe Tillet
|
392b55280d
|
[codegen] some cleaning for batched matmul
|
2019-08-07 21:17:17 -07:00 |
|
Philippe Tillet
|
7b75b68edc
|
dirty but working warp-splitting
|
2019-08-06 21:07:13 -07:00 |
|
Philippe Tillet
|
494bfa7671
|
didn't break correctness of existing HMMA
|
2019-08-06 17:34:00 -07:00 |
|
Philippe Tillet
|
46e9863ebe
|
better fp16 support for dot
|
2019-08-06 17:19:13 -07:00 |
|
Philippe Tillet
|
0e201e18ff
|
fixed simple FP16 test
|
2019-08-06 17:14:16 -07:00 |
|
Philippe Tillet
|
cf256a636c
|
fixup
|
2019-08-06 16:44:16 -07:00 |
|
Philippe Tillet
|
5efdb7978e
|
more improvements and regressions
|
2019-08-06 16:21:20 -07:00 |
|
Philippe Tillet
|
d62e581ab3
|
basic split-k across warps working for GEMM
|
2019-08-05 19:33:28 -07:00 |
|
Philippe Tillet
|
d9945692a9
|
[dnn] better specification of recompilation key
|
2019-08-02 17:42:48 -07:00 |
|
Philippe Tillet
|
17cb2db356
|
[dnn/blocksparse/dot] prototype version seems to pass basic test
|
2019-07-27 21:21:36 -07:00 |
|
Philippe Tillet
|
2a377bc8b1
|
[ir] deleted mask/merge instructions; will be replaced by masked_load/store and select
|
2019-07-25 15:06:15 -07:00 |
|
Philippe Tillet
|
38b3771c26
|
some reassociation
|
2019-07-23 14:43:18 -07:00 |
|
Philippe Tillet
|
c448876178
|
better benchmarking
|
2019-07-22 19:26:12 -07:00 |
|
Philippe Tillet
|
ead368d1ed
|
[general] a bunch of fixes in anticipation of proper triton vs cudnn
benchmarks
* DNN: Added partial auto-tuning mode and skeleton for heuristics
* Examples: Moduralized benchmarking and now evaluating ResNet-18 shapes
|
2019-07-21 20:17:56 -07:00 |
|
Philippe Tillet
|
b1d81a5802
|
more work on heuristics
|
2019-07-21 18:11:54 -07:00 |
|
Philippe Tillet
|
484e3871cf
|
[dnn/shift] added base pointer for a, b
|
2019-07-20 23:00:27 -07:00 |
|
Philippe Tillet
|
d159455f7b
|
[codegen/alignment_info] better alignment information
|
2019-07-20 21:44:18 -07:00 |
|
Philippe Tillet
|
28c250216c
|
[dnn/gemm] added some bounds checking
|
2019-07-19 21:32:55 -07:00 |
|
Philippe Tillet
|
5215fb0424
|
[codegen] some more optimizations
|
2019-07-19 20:29:03 -07:00 |
|
Philippe Tillet
|
71594da66f
|
[dnn/gemm]: fixed leading dimension in transposed variants
|
2019-07-18 16:35:48 -07:00 |
|
Philippe Tillet
|
f0d8306437
|
[codegen/alignment_info] better handling of constants
|
2019-07-18 16:12:06 -07:00 |
|
Philippe Tillet
|
86f70f8224
|
[codegen/selection] performance fix-up when A is transposed for hmma
|
2019-07-17 21:46:23 -07:00 |
|
Philippe Tillet
|
2f0817b2cd
|
[codegen/selection] tensor cores now used for transposed layotus
|
2019-07-17 17:20:38 -07:00 |
|
Philippe Tillet
|
bfa39b8992
|
preparing the field for tensor cores transposes
|
2019-07-17 13:20:33 -07:00 |
|
Philippe Tillet
|
164d85077f
|
more stuff
|
2019-07-16 15:03:53 -07:00 |
|
Philippe Tillet
|
28959fe165
|
[runtime/jit] made auto-tuning silent
|
2019-07-16 14:41:38 -07:00 |
|
Philippe Tillet
|
7d1797cd32
|
ugh
|
2019-07-16 12:59:27 -07:00 |
|
Philippe Tillet
|
7512c7ebed
|
some cleaning
|
2019-07-12 20:03:05 -07:00 |
|
Philippe Tillet
|
b7986baffa
|
[dnn]: Now implementing all existing DNN routines using common base template and auto-tuner
|
2019-07-09 19:52:55 -07:00 |
|
Philippe Tillet
|
8fc253946c
|
[codegen] shift: added sketch for shift-convolution backpropagation
|
2019-07-02 16:39:07 -07:00 |
|
Philippe Tillet
|
9a86bc51e1
|
[language] added alignment metadata for variables
|
2019-06-29 13:58:46 -07:00 |
|
Philippe Tillet
|
d945ce5e1b
|
Now showing valid parameter for NN
|
2019-06-25 19:18:43 -07:00 |
|
Philippe Tillet
|
06b5992509
|
[feature] added basic tensor core support
|
2019-06-11 10:24:49 -07:00 |
|
Philippe Tillet
|
f58c9a4d2b
|
[general] hmma baseline setup
|
2019-06-05 14:43:38 -07:00 |
|
Philippe Tillet
|
8102efc064
|
[triton/examples/cpp] removed common.hpp helper
|
2019-05-28 14:14:33 -04:00 |
|
Philippe Tillet
|
a9d078c06f
|
[triton/dnn/conv] merged optimizations branch
- Added forward/backward support for strided convolution
- Added support for bias
- Added support for reduction splitting
|
2019-05-28 14:04:53 -04:00 |
|
Philippe Tillet
|
b2b55c52c9
|
[triton/python/conv]: Added cache for compiled kernels
|
2019-05-18 11:51:49 -04:00 |
|
Philippe Tillet
|
ece7beea3c
|
[dnn/conv]: now using look-up table for wgrad computation as well
|
2019-05-16 15:26:16 -04:00 |
|