Commit Graph

89 Commits

Author SHA1 Message Date
Philippe Tillet
b4a9ed9663 [python] added basic tensorflow support 2019-08-17 18:18:26 -07:00
Philippe Tillet
11a6a92598 [python][tensorflow] basic op generation is working 2019-08-16 20:50:18 -07:00
Philippe Tillet
c7cb5f82ad [general] removed LLVM #include's in all Triton headers 2019-08-16 15:56:58 -07:00
Philippe Tillet
3ece461ce2 added tensorflow code generator 2019-08-15 15:59:53 -07:00
Philippe Tillet
38a8b0ab19 [runtime] overall of the run-time API 2019-08-14 20:26:11 -07:00
Philippe Tillet
1400d960a6 [auto-tuning] much smaller parameters space 2019-08-12 21:15:21 -07:00
Philippe Tillet
fd49cdc92b [dnn][blocksparse] added dw code 2019-08-08 19:15:35 -07:00
Philippe Tillet
f93099bda1 [codegen][transform][trans] fixed incorrect replace_all_uses_with 2019-08-07 21:50:16 -07:00
Philippe Tillet
7578c27d3d [general][filesystem] added structure and namespace to code generation files 2019-08-07 21:17:17 -07:00
Philippe Tillet
392b55280d [codegen] some cleaning for batched matmul 2019-08-07 21:17:17 -07:00
Philippe Tillet
7b75b68edc dirty but working warp-splitting 2019-08-06 21:07:13 -07:00
Philippe Tillet
494bfa7671 didn't break correctness of existing HMMA 2019-08-06 17:34:00 -07:00
Philippe Tillet
46e9863ebe better fp16 support for dot 2019-08-06 17:19:13 -07:00
Philippe Tillet
0e201e18ff fixed simple FP16 test 2019-08-06 17:14:16 -07:00
Philippe Tillet
cf256a636c fixup 2019-08-06 16:44:16 -07:00
Philippe Tillet
5efdb7978e more improvements and regressions 2019-08-06 16:21:20 -07:00
Philippe Tillet
d62e581ab3 basic split-k across warps working for GEMM 2019-08-05 19:33:28 -07:00
Philippe Tillet
d9945692a9 [dnn] better specification of recompilation key 2019-08-02 17:42:48 -07:00
Philippe Tillet
17cb2db356 [dnn/blocksparse/dot] prototype version seems to pass basic test 2019-07-27 21:21:36 -07:00
Philippe Tillet
2a377bc8b1 [ir] deleted mask/merge instructions; will be replaced by masked_load/store and select 2019-07-25 15:06:15 -07:00
Philippe Tillet
38b3771c26 some reassociation 2019-07-23 14:43:18 -07:00
Philippe Tillet
c448876178 better benchmarking 2019-07-22 19:26:12 -07:00
Philippe Tillet
ead368d1ed [general] a bunch of fixes in anticipation of proper triton vs cudnn
benchmarks

* DNN: Added partial auto-tuning mode and skeleton for heuristics
* Examples: Moduralized benchmarking and now evaluating ResNet-18 shapes
2019-07-21 20:17:56 -07:00
Philippe Tillet
b1d81a5802 more work on heuristics 2019-07-21 18:11:54 -07:00
Philippe Tillet
484e3871cf [dnn/shift] added base pointer for a, b 2019-07-20 23:00:27 -07:00
Philippe Tillet
d159455f7b [codegen/alignment_info] better alignment information 2019-07-20 21:44:18 -07:00
Philippe Tillet
28c250216c [dnn/gemm] added some bounds checking 2019-07-19 21:32:55 -07:00
Philippe Tillet
5215fb0424 [codegen] some more optimizations 2019-07-19 20:29:03 -07:00
Philippe Tillet
71594da66f [dnn/gemm]: fixed leading dimension in transposed variants 2019-07-18 16:35:48 -07:00
Philippe Tillet
f0d8306437 [codegen/alignment_info] better handling of constants 2019-07-18 16:12:06 -07:00
Philippe Tillet
86f70f8224 [codegen/selection] performance fix-up when A is transposed for hmma 2019-07-17 21:46:23 -07:00
Philippe Tillet
2f0817b2cd [codegen/selection] tensor cores now used for transposed layotus 2019-07-17 17:20:38 -07:00
Philippe Tillet
bfa39b8992 preparing the field for tensor cores transposes 2019-07-17 13:20:33 -07:00
Philippe Tillet
a55b098e88 [dnn/shift] now using constant divisions 2019-07-16 21:05:21 -07:00
Philippe Tillet
ec24e1e7df trying to remove interior logic 2019-07-16 18:47:50 -07:00
Philippe Tillet
164d85077f more stuff 2019-07-16 15:03:53 -07:00
Philippe Tillet
28959fe165 [runtime/jit] made auto-tuning silent 2019-07-16 14:41:38 -07:00
Philippe Tillet
7d1797cd32 ugh 2019-07-16 12:59:27 -07:00
Philippe Tillet
aa8bcf6bde [dnn/shift] added split-k for shift-conv 2019-07-15 21:03:58 -07:00
Philippe Tillet
434f65737f [runtime] put jit::launch_info in another file 2019-07-15 12:35:53 -07:00
Philippe Tillet
3c128fc2e2 [jit/autotune] added support for multi-threaded auto-tuning 2019-07-14 22:31:30 -07:00
Philippe Tillet
3e7a3ed67a [dnn/shift]: added support for fp16 2019-07-13 21:05:34 -07:00
Philippe Tillet
fe42cb7142 [dnn/shift] optimizations for NCHW layout 2019-07-12 20:22:32 -07:00
Philippe Tillet
7512c7ebed some cleaning 2019-07-12 20:03:05 -07:00
Philippe Tillet
b7986baffa [dnn]: Now implementing all existing DNN routines using common base template and auto-tuner 2019-07-09 19:52:55 -07:00
Philippe Tillet
88675fa01a [dnn] added base template class for mutualized auto-tuning 2019-07-09 16:09:34 -07:00
Philippe Tillet
1d88f0a36b stuff 2019-07-03 19:25:16 -07:00
Philippe Tillet
8fc253946c [codegen] shift: added sketch for shift-convolution backpropagation 2019-07-02 16:39:07 -07:00
Philippe Tillet
c172bd518b more stuff 2019-06-30 16:55:02 -07:00
Philippe Tillet
9a86bc51e1 [language] added alignment metadata for variables 2019-06-29 13:58:46 -07:00