Philippe Tillet
|
bb0f9235d1
|
[OPTIMIZER] Made layout simplification pass efficient for fused attention kernels (#790)
|
2022-10-21 16:52:15 -07:00 |
|
Philippe Tillet
|
623c99609f
|
[Triton-IR] Added type inference and verifier for Triton-IR operations (#767)
|
2022-10-11 18:16:41 -07:00 |
|
Shintaro Iwasaki
|
43be75ad42
|
[FRONTEND] Add scalar type support for some ops (#661)
This PR adds basic support for scalar-type inputs to some ops (cast and pointer arithmetics) for Triton-MLIR. Also renames getelementptr -> addptr
|
2022-09-15 16:12:52 -07:00 |
|
Da Yan
|
2e08450c80
|
[OPTIMIZER] Better pipeline tests (#660)
|
2022-09-14 23:26:40 -07:00 |
|
Philippe Tillet
|
9bd5a3dcd2
|
[OPTIMIZER] Pipeline async buffer (#110)
|
2022-09-09 11:01:14 -07:00 |
|
Da Yan
|
35e346bcff
|
[OPTIMIZER] Better pipeline pass (#100)
* Use `insert_slice_async` instead of `CopyAsync`
* Move async.wait to loop header
Co-authored-by: Jokeren <kerenzhou@openai.com>
|
2022-09-06 08:31:13 -07:00 |
|
Da Yan
|
92ef552a54
|
[OPTIMIZER] Fix Num in AsyncWaitOp generated by the pipeline pass (#72)
|
2022-08-22 15:58:10 -07:00 |
|
Shintaro Iwasaki
|
9aa00249a6
|
[TritonIR] make other optional and remove isOtherUnspecified (#67)
[Triton] make other optional and remove isOtherUnspecified
|
2022-08-18 18:19:55 -07:00 |
|
Da Yan
|
8776ad1a0e
|
[OPTIMIZER] Let the pipeline pass insert async wait. (#63)
|
2022-08-18 10:31:57 -07:00 |
|
Shintaro Iwasaki
|
d69ce77b19
|
[FRONTEND] add an attr for masked load without explicit other (#55)
|
2022-08-18 09:51:37 -07:00 |
|
Philippe Tillet
|
d1593e6ca8
|
[TritonGPU] Improved documentation and semantics of layout encodings (#30)
|
2022-07-31 13:59:44 -07:00 |
|
Yan Da
|
26fcc12afd
|
better unit tests
|
2022-06-07 19:35:38 +08:00 |
|
Yan Da
|
0e11435448
|
more tests
|
2022-06-06 21:10:28 +08:00 |
|