Commit Graph

39 Commits

Author SHA1 Message Date
Philippe Tillet
88ebdddf3d makes more sense now 2019-07-03 20:45:03 -07:00
Philippe Tillet
bd1040510f dx works but that makes no sense? 2019-07-03 20:24:52 -07:00
Philippe Tillet
1b2ceadf0d weight gradient seem to work 2019-07-03 20:04:38 -07:00
Philippe Tillet
39aa22babb more tinkering 2019-07-03 19:52:31 -07:00
Philippe Tillet
1d88f0a36b stuff 2019-07-03 19:25:16 -07:00
Philippe Tillet
0d8faa5b1e fixup 2019-07-02 21:38:10 -07:00
Philippe Tillet
5144dc3a6c [examples/python] added framework code for shift-conv 2019-07-02 20:45:10 -07:00
Philippe Tillet
6300ec5080 [examples] added conv2d op in tensorflow 2019-06-26 18:50:53 -07:00
Philippe Tillet
f1a8972267 [examples] added tensorflow dense convolution templates 2019-06-26 11:39:22 -07:00
Philippe Tillet
25e9a10917 changed auto-tuner parameter ranges 2019-06-25 19:27:49 -07:00
Philippe Tillet
d945ce5e1b Now showing valid parameter for NN 2019-06-25 19:18:43 -07:00
Philippe Tillet
616f22c610 confirmed this is the fastest bounds checking 2019-06-25 16:35:43 -07:00
Philippe Tillet
64513fb407 [codegen] added fallback when tensor cores cannot be used 2019-06-25 15:49:58 -07:00
Philippe Tillet
d52abc9379 [codegen] bugfix in alignment inference 2019-06-25 15:06:15 -07:00
Philippe Tillet
edc31cabb0 [codegen] rough template for axis_info pass 2019-06-24 18:57:32 -07:00
Philippe Tillet
72867d17d4 more cleaning 2019-06-24 12:37:13 -07:00
Philippe Tillet
f7dcea1187 Now doing double-buffering 2019-06-13 19:48:02 -07:00
Philippe Tillet
36e3667a9a removed shared conflicts for 8x32x4 and 32x8x4 configurations 2019-06-13 17:51:54 -07:00
Philippe Tillet
21a9b92c87 disabling interleaving 2019-06-13 17:16:00 -07:00
Philippe Tillet
d487cf31ce trying 128 bits loads 2019-06-12 21:07:01 -07:00
Philippe Tillet
a6b580ec05 interleaving fails with B 2019-06-12 19:46:43 -07:00
Philippe Tillet
1b5a742a88 [triton/codegen] added shared memory padding for HMMA arguments and vectorized loads 2019-06-11 19:51:08 -07:00
Philippe Tillet
cbd916994d [example/tensorflow] no longer hardcoding library dir 2019-06-11 11:06:02 -07:00
Philippe Tillet
06b5992509 [feature] added basic tensor core support 2019-06-11 10:24:49 -07:00
Philippe Tillet
d074a166e2 [feature] basic tensor core utilization works 2019-06-08 14:39:45 -07:00
Philippe Tillet
ec4c6aaaaa Added inline PTX for mma.sync 2019-06-07 19:39:33 -07:00
Philippe Tillet
6fce9f28ae added fragmented axis 2019-06-07 10:32:56 -07:00
Philippe Tillet
781b6d377d seleciton now segfault (expected 2019-06-06 20:34:56 -07:00
Philippe Tillet
6045209d5b Now find correct tuning configuration 2019-06-06 20:13:26 -07:00
Philippe Tillet
cdf5a0d011 [codegen/tune]: added fragmentation types 2019-06-06 16:48:32 -07:00
Philippe Tillet
f58c9a4d2b [general] hmma baseline setup 2019-06-05 14:43:38 -07:00
Philippe Tillet
49fcfd6fc7 [examples/tensorflow] fixed #include issue 2019-06-05 11:09:41 -07:00
Philippe Tillet
fd91368f98 [general] creation of dnn module for gemm/conv triton routines 2019-05-06 17:47:06 -04:00
Philippe Tillet
0d694445e6 [examples] added skeleton for pytorch wrapper 2019-05-03 14:30:06 -04:00
Philippe Tillet
70f49a56c1 [examples/python/tensorflow] better skeleton for blocksparse 2019-05-01 17:09:01 -04:00
Philippe Tillet
55866f1ef6 [examples/python/tensorflow] fixed ABI compatibility issue in JIT
destructor
2019-05-01 13:38:56 -04:00
Philippe Tillet
7b6efc0463 [examples/python/tensorflow] bugfix in tensorflow wrapper example 2019-04-30 21:04:30 -04:00
Philippe Tillet
d934d8fb40 [examples/python/tensorflow] improved matmul wrapper 2019-04-30 12:25:35 -04:00
Philippe Tillet
8e809a9536 [examples] added skeleton for tensorflow op 2019-04-30 10:50:54 -04:00