goostavz
de2dd04c8a
[BACKEND] two minor bugfix on StoreOpLowering and kernel launch & support optional other in LoadOpLowering ( #69 )
...
* [BACKEND] two minor bugfix on StoreOpLowering and kernel launch & support optional other in LoadOpLowering
* Clean code
Co-authored-by: goostavz <gzhu@nvidia.com >
Co-authored-by: Yan Chunwei <yanchunwei@outlook.com >
2022-08-22 21:47:09 -07:00
Da Yan
92ef552a54
[OPTIMIZER] Fix Num in AsyncWaitOp generated by the pipeline pass ( #72 )
2022-08-22 15:58:10 -07:00
Yan Chunwei
10ba51c3bb
[FRONTEND] add python e2e launch empty kernel test ( #68 )
2022-08-19 10:46:01 -07:00
Shintaro Iwasaki
9aa00249a6
[TritonIR] make other optional and remove isOtherUnspecified ( #67 )
...
[Triton] make other optional and remove isOtherUnspecified
2022-08-18 18:19:55 -07:00
Philippe Tillet
192be76b3c
[OPTIMIZER] Rewrite patterns for layout conversions ( #64 )
2022-08-18 12:49:37 -07:00
Keren Zhou
e0bedeb44c
[BACKEND] Keren/shared memory barrier ( #59 )
2022-08-18 12:32:57 -07:00
Da Yan
8776ad1a0e
[OPTIMIZER] Let the pipeline pass insert async wait. ( #63 )
2022-08-18 10:31:57 -07:00
Shintaro Iwasaki
d69ce77b19
[FRONTEND] add an attr for masked load without explicit other ( #55 )
2022-08-18 09:51:37 -07:00
goostavz
fc58250a06
[BACKEND] Add backend support of arith::AddIOp, arith::AddFOp, GetProgramIdOp & GEPOp and bugfix for SplatOp, StoreOp, FuncOp ( #60 )
...
Add backend support of arith::AddIOp, arith::AddFOp, GetProgramIdOp, GEPOp and bugfix for SplatOp, StoreOp, FuncOp
Co-authored-by: gzhu <gzhu@nvidia.com >
2022-08-18 20:46:45 +08:00
Yan Chunwei
b1673caaf6
[FRONTEND] Expose end-to-end compile to python frontend ( #58 )
2022-08-17 10:42:48 -07:00
Yan Chunwei
95bbac41e7
[BACKEND] Add LLVM-translation for store and splat ops ( #47 )
2022-08-15 00:46:37 -07:00
goostavz
993ba7035a
[BACKEND] Codegen bringup, index calculation of blocked_layout & support of LoadOp, BroadcastOp, ViewOp & MakeRangeOp ( #38 )
...
Co-authored-by: gzhu <gzhu@nvidia.com >
2022-08-14 19:58:59 -07:00
Shintaro Iwasaki
2ba9a83465
[BUILD] fix minor issues with MLIR assert enabled ( #46 )
2022-08-11 21:20:47 -07:00
Yan Chunwei
83ef74f248
[BACKEND] Extracting numWarps from tritonGPU module ( #39 )
2022-08-08 09:40:20 -07:00
Yan Chunwei
920723cf3d
[BACKEND] add triton-translate to translate mlir to llvmir or PTX code ( #37 )
2022-08-07 22:34:36 -07:00
Philippe Tillet
78ebbe24c7
[FRONTEND] Added ExpandDimsOp
primitive ( #36 )
2022-08-04 18:41:06 -07:00
Keren Zhou
a7b49b3227
[BACKEND] Memory allocation ( #33 )
2022-08-04 11:22:49 -07:00
Yan Chunwei
b988bae813
Init TritonGPU to LLVM dialect conversion ( #32 )
...
* add toLLVM pass
* update num-warps setting in mlir
2022-08-04 10:15:45 +08:00
Philippe Tillet
3236642e8f
[OPTIMIZER] Added memory coalescing pass ( #31 )
2022-07-31 20:59:31 -07:00
Philippe Tillet
d1593e6ca8
[TritonGPU] Improved documentation and semantics of layout encodings ( #30 )
2022-07-31 13:59:44 -07:00
Yan Chunwei
e02c82c765
[TritonIR] Convert Triton dialect's Combine
pass to MLIR DRR based ( #16 )
2022-07-27 12:50:08 -07:00
Philippe Tillet
432c3df265
[BUILD] MacOS can now build compiler and run MLIR tests ( #25 )
2022-07-27 01:32:10 -07:00
Philippe Tillet
6d62d88d4f
[CI] run clang-format ( #24 )
2022-07-26 17:25:03 -07:00
Philippe Tillet
25357083e6
[CI] Added basic CI skeletons ( #23 )
...
Includes minor fixes to make things compile and pass static checks properly
2022-07-26 14:16:30 -07:00
Philippe Tillet
3265e0df5a
[PYTHON] Cleaned up legacy code; added simple standalone compilation API ( #22 )
2022-07-26 11:06:45 -07:00
Keren Zhou
96cc6fb563
[TritonGPU] Pretty printer for layouts ( #21 )
2022-07-26 10:50:11 -07:00
Philippe Tillet
a633d2b403
[Analysis] Added Axis Info Analysis ( #8 )
2022-07-19 13:38:48 -07:00
Yan Da
63e6a85901
Fix blocked layout parser
2022-07-15 15:19:11 +08:00
Yan Da
9d1b5e3f79
special encoding for broadcast
2022-06-18 21:16:45 +08:00
Yan Da
53cf93ce6a
Revert "Remove TypeConverter from TritonToTritonGPU conversion"
...
This reverts commit 64d0b87ef0
.
2022-06-18 14:57:41 +08:00
Yan Da
64d0b87ef0
Remove TypeConverter from TritonToTritonGPU conversion
2022-06-18 14:34:59 +08:00
Yan Da
9feb256b71
op combine in Triton Dialect: broadcast(cst) -> cst
2022-06-17 16:19:47 +08:00
Yan Da
117a402c1b
more comments to TypeConverter & update warpTileSize
2022-06-08 16:20:07 +08:00
Yan Da
7b09b5f9e9
the pipeline pass now generates and accepts valid IR
2022-06-07 19:34:59 +08:00
Yan Da
366dddc3bc
update mma encoding & triton-opt
2022-06-06 21:03:58 +08:00
Yan Da
7807f64ef3
rename sharded_layout => blocked_layout
2022-06-05 16:14:59 +08:00
Yan Da
a4a2c72173
default address space of PointerType 0 => 1
2022-06-05 15:09:41 +08:00
Yan Da
d5eca56cf3
more TritonGPU unit tests
2022-06-05 14:25:09 +08:00
Da Yan
e36a54eb86
more progress on the definition of layouts
2022-05-31 11:43:21 +00:00
Yan Da
41d338d848
Fix op mapping in pipeline.cpp
2022-05-26 13:57:01 +08:00
Yan Da
c529b462f5
more fixes on pipeline.cpp
2022-05-26 13:14:41 +08:00
Yan Da
71d1c10e19
Remove weird includes
2022-05-25 21:54:06 +08:00
Yan Da
9308e9c90c
A more general pipeliner
2022-05-25 21:52:51 +08:00
Yan Da
441fd7c3cc
assembly format
2022-05-25 17:53:24 +08:00
Yan Da
e6f89a5777
Fix ReduceOp conversion
2022-05-25 16:03:06 +08:00
Yan Da
9b670cfb9f
Add ReduceOp
2022-05-25 14:15:36 +08:00
Yan Da
a2c9f919a8
TritonGPU verifier
2022-05-24 19:48:56 +08:00
Yan Da
36c45ec687
make numStages an option in PipelinePass
2022-05-23 12:47:55 +08:00
Yan Da
79298d61bc
fix a pipeline issue
2022-05-16 19:38:40 +08:00
Yan Da
c3c4ac3733
TritonGPU combiner
2022-05-16 19:17:15 +08:00