Commit Graph

639 Commits

Author SHA1 Message Date
Philippe Tillet
b162c44d59 .
y
2023-01-09 22:58:40 -08:00
Phil Tillet
2fa0dfbce9 . 2023-01-09 22:50:38 -08:00
Phil Tillet
993bc17311 cleanup 2023-01-09 22:35:00 -08:00
Philippe Tillet
ff04a5e9b6 . 2023-01-09 22:11:00 -08:00
Phil Tillet
d88353a5a4 . 2023-01-09 20:14:06 -08:00
Phil Tillet
bae4c40379 reorder conversions to dot operand 2023-01-09 20:13:30 -08:00
Phil Tillet
c98c889d7f . 2023-01-09 19:08:51 -08:00
Phil Tillet
fc1007278d . 2023-01-09 18:45:44 -08:00
Phil Tillet
0c101e0c33 . 2023-01-09 16:30:28 -08:00
Phil Tillet
3fefcd78d4 . 2023-01-09 16:29:45 -08:00
Phil Tillet
137e866bd2 more work 2023-01-09 16:20:10 -08:00
Phil Tillet
8ebb593bbb more work 2023-01-09 15:45:06 -08:00
Phil Tillet
6c750b6856 Added verifier for trans 2023-01-08 14:29:17 -08:00
Phil Tillet
42421fabc5 . 2023-01-06 20:35:57 -08:00
Phil Tillet
600bcefb12 more optimizations 2023-01-06 20:27:49 -08:00
Philippe Tillet
18c7a72973 more pass template 2023-01-06 14:26:06 -08:00
Phil Tillet
b16aeb6541 added missing file 2023-01-06 13:39:54 -08:00
Phil Tillet
a81345f7c1 SinkConversionsFromShared template 2023-01-06 13:01:08 -08:00
Philippe Tillet
874ee11ab5 More optimizations 2023-01-06 11:04:20 -08:00
Philippe Tillet
e6f1a9ad34 commenting dq but not load/store 2023-01-05 23:25:41 -08:00
Philippe Tillet
6f997f4ecb dq now mma 2023-01-05 21:14:55 -08:00
Phil Tillet
520b69fe70 more reassociation 2023-01-05 16:05:11 -08:00
Phil Tillet
764134ee34 trying to decrease register pressure 2023-01-05 13:02:38 -08:00
Phil Tillet
1bde80b1e8 Added ptx code 2023-01-04 17:23:16 -08:00
Phil Tillet
268d2cd18d better convert + write-back 2023-01-04 17:12:35 -08:00
Phil Tillet
29a1e20b58 tweak convert + trans 2023-01-04 17:12:28 -08:00
Phil Tillet
36da342893 . 2023-01-04 11:25:03 -08:00
Phil Tillet
e70e1e76b4 swizzling 2023-01-04 11:21:19 -08:00
Phil Tillet
e3c3d9fc65 16 spills 2023-01-04 00:01:22 -08:00
Phil Tillet
ee86ea9c90 100 spills 2023-01-03 20:52:00 -08:00
Phil Tillet
645fa5c1cd . 2023-01-03 18:34:05 -08:00
Phil Tillet
8df1fa5e5b Merge remote-tracking branch 'origin/master' into phil/fused-attention-perf-fixup 2023-01-03 18:31:34 -08:00
Keren Zhou
8460ea3df1 [Frontend] Fix import for libdevice (#1028)
This is a hotfix for issue 1 in
https://github.com/openai/triton/issues/1017
2023-01-03 15:48:05 -08:00
Keren Zhou
678b9f53a2 [Backend] Use post-order traversal for liveness numbering (#1027)
Also add tests for `tt.trans`.
2023-01-03 15:11:54 -08:00
Phil Tillet
737e43a627 more tests 2023-01-03 09:48:08 -08:00
Phil Tillet
5c01c567b9 . 2023-01-02 23:13:12 -08:00
Phil Tillet
05920e0b8b reduced some spilling 2023-01-02 19:28:54 -08:00
Phil Tillet
c11fe351e1 . 2023-01-02 19:16:06 -08:00
Phil Tillet
b246d85fad trying to figure out spilling root cause 2022-12-30 15:21:00 -08:00
Phil Tillet
4dce8dd709 Merge remote-tracking branch 'origin/master' into phil/fused-attention-perf-fixup 2022-12-30 11:53:49 -08:00
goostavz
0e8590f1c9 [BACKEND] Add generic support of convert_layout from distributed to shared (#1025) 2022-12-30 11:29:58 -08:00
Phil Tillet
7388fb1de9 manual ttgir in bwd pass 2022-12-29 15:53:38 -08:00
fdrocha
194ba103b1 [BUILD] Fixed error when compiling in systems with multiple versions of python installed (#1019) 2022-12-29 15:10:34 -08:00
Phil Tillet
71e3143eaf . 2022-12-29 14:40:27 -08:00
goostavz
1d3029faf8 [Backend] Add value cache in emitting indices calculation and some refinement (#1018)
1, add explicit value cache in emitting indices calculation;
2, move the indices calculation emitting logics into
ConvertTritonGPUOpToLLVMPatternBase to avoid the redundant build cost by
templates. Refer to the discussion in this thread by @LyricZhao :
https://triton-lang.slack.com/archives/C042VBSQWNS/p1671336755922969
2022-12-29 11:19:59 -08:00
Phil Tillet
263ad883a6 . 2022-12-28 14:23:59 -08:00
Phil Tillet
54ae3e8d6e cleanup 2022-12-28 13:42:43 -08:00
Phil Tillet
7aba2a60d6 trying out another change 2022-12-27 21:51:51 -08:00
Phil Tillet
eefc9d1274 Added TTGIR kernel 2022-12-27 21:49:28 -08:00
Phil Tillet
0d6e6cf578 trying more things 2022-12-27 20:58:31 -08:00