Commit Graph
Select branches
Hide Pull Requests
fix-extelemwise-in-combine-ops
gh-pages
jit-hook
keren/assert
keren/improve-hook
keren/insert-slice-other-nonzero
keren/perf-debug
keren/v100-perf-regression
master
phil/fused-attention-perf-fixup
phil/mma-v1-is-row-debug
phil/swizzle-bug-repro
port-fma
rocm
#10
#100
#1000
#1001
#1002
#1004
#1006
#1007
#1008
#101
#1010
#1012
#1013
#1013
#1014
#1018
#1019
#102
#1020
#1020
#1025
#1027
#1028
#1029
#103
#1030
#1033
#1034
#1036
#1037
#1038
#1039
#104
#1042
#1043
#1043
#105
#106
#107
#108
#109
#11
#11
#110
#111
#112
#114
#116
#118
#119
#120
#121
#123
#124
#125
#126
#127
#128
#129
#13
#131
#132
#133
#134
#135
#136
#137
#138
#139
#140
#141
#142
#143
#144
#145
#146
#147
#148
#149
#15
#151
#152
#158
#164
#165
#167
#168
#172
#173
#178
#179
#18
#180
#185
#186
#188
#19
#190
#192
#193
#195
#198
#199
#20
#200
#203
#204
#205
#207
#209
#212
#219
#22
#222
#224
#225
#228
#23
#23
#231
#24
#240
#249
#250
#251
#253
#255
#256
#257
#258
#259
#260
#261
#268
#271
#272
#273
#276
#279
#28
#280
#281
#282
#283
#285
#286
#287
#288
#291
#292
#293
#294
#295
#296
#297
#298
#299
#3
#300
#301
#302
#303
#304
#305
#306
#307
#308
#309
#311
#312
#313
#314
#317
#318
#320
#324
#326
#331
#336
#337
#338
#342
#344
#345
#346
#347
#348
#349
#35
#350
#351
#356
#357
#358
#361
#362
#367
#368
#372
#373
#374
#377
#379
#38
#380
#381
#382
#383
#386
#387
#388
#390
#391
#392
#393
#394
#395
#396
#397
#399
#40
#400
#401
#403
#406
#407
#408
#409
#41
#413
#414
#415
#417
#418
#420
#421
#422
#423
#424
#425
#426
#427
#428
#430
#431
#432
#436
#438
#439
#440
#442
#444
#445
#446
#447
#448
#449
#45
#450
#451
#453
#455
#456
#457
#458
#462
#463
#464
#467
#468
#469
#470
#471
#473
#474
#478
#481
#482
#483
#484
#485
#487
#488
#490
#491
#492
#493
#495
#499
#500
#501
#502
#503
#505
#507
#510
#513
#514
#515
#516
#519
#52
#520
#522
#523
#524
#526
#527
#528
#53
#531
#533
#534
#535
#538
#538
#539
#541
#545
#546
#547
#548
#549
#551
#552
#553
#555
#556
#557
#559
#560
#561
#562
#564
#565
#567
#569
#57
#570
#571
#572
#575
#575
#577
#578
#579
#58
#582
#587
#588
#59
#590
#595
#598
#60
#600
#601
#602
#604
#606
#607
#608
#61
#614
#614
#617
#62
#623
#63
#632
#636
#637
#644
#65
#650
#651
#652
#653
#654
#655
#657
#658
#66
#660
#661
#662
#663
#664
#665
#666
#667
#668
#669
#670
#671
#672
#678
#68
#682
#683
#684
#685
#689
#69
#691
#692
#693
#694
#696
#697
#699
#7
#70
#700
#701
#702
#703
#704
#706
#708
#709
#71
#710
#711
#712
#715
#716
#718
#722
#724
#726
#727
#728
#729
#73
#732
#733
#735
#736
#738
#739
#740
#742
#746
#747
#749
#75
#750
#751
#752
#753
#754
#755
#757
#758
#759
#764
#765
#766
#767
#769
#77
#774
#775
#776
#777
#78
#780
#782
#784
#785
#786
#788
#789
#790
#791
#792
#794
#796
#797
#798
#799
#80
#800
#801
#803
#804
#805
#809
#81
#812
#814
#815
#816
#817
#818
#819
#82
#820
#821
#822
#823
#825
#826
#827
#829
#83
#830
#831
#833
#834
#835
#836
#837
#838
#839
#840
#841
#842
#843
#844
#845
#847
#848
#849
#850
#851
#852
#853
#854
#856
#857
#858
#859
#86
#862
#863
#864
#867
#868
#869
#87
#872
#873
#874
#875
#876
#877
#878
#879
#88
#880
#881
#883
#885
#886
#887
#887
#888
#889
#89
#890
#890
#894
#896
#897
#898
#899
#90
#901
#902
#903
#904
#906
#907
#908
#909
#91
#910
#912
#913
#914
#915
#916
#917
#918
#92
#920
#921
#922
#923
#924
#925
#926
#927
#928
#929
#93
#930
#931
#933
#936
#937
#938
#939
#94
#941
#943
#944
#945
#946
#947
#947
#948
#95
#951
#952
#953
#956
#957
#958
#959
#96
#960
#961
#962
#963
#964
#966
#968
#969
#97
#970
#971
#972
#973
#975
#976
#977
#978
#979
#980
#982
#982
#983
#985
#987
#988
#990
#991
#993
#994
#995
#996
#997
#998
#999
isaac
legacy-backend
v0.1
v0.2.3
v0.4
v1.0
v1.1
v1.1.1
v1.1.2
Select branches
Hide Pull Requests
fix-extelemwise-in-combine-ops
gh-pages
jit-hook
keren/assert
keren/improve-hook
keren/insert-slice-other-nonzero
keren/perf-debug
keren/v100-perf-regression
master
phil/fused-attention-perf-fixup
phil/mma-v1-is-row-debug
phil/swizzle-bug-repro
port-fma
rocm
#10
#100
#1000
#1001
#1002
#1004
#1006
#1007
#1008
#101
#1010
#1012
#1013
#1013
#1014
#1018
#1019
#102
#1020
#1020
#1025
#1027
#1028
#1029
#103
#1030
#1033
#1034
#1036
#1037
#1038
#1039
#104
#1042
#1043
#1043
#105
#106
#107
#108
#109
#11
#11
#110
#111
#112
#114
#116
#118
#119
#120
#121
#123
#124
#125
#126
#127
#128
#129
#13
#131
#132
#133
#134
#135
#136
#137
#138
#139
#140
#141
#142
#143
#144
#145
#146
#147
#148
#149
#15
#151
#152
#158
#164
#165
#167
#168
#172
#173
#178
#179
#18
#180
#185
#186
#188
#19
#190
#192
#193
#195
#198
#199
#20
#200
#203
#204
#205
#207
#209
#212
#219
#22
#222
#224
#225
#228
#23
#23
#231
#24
#240
#249
#250
#251
#253
#255
#256
#257
#258
#259
#260
#261
#268
#271
#272
#273
#276
#279
#28
#280
#281
#282
#283
#285
#286
#287
#288
#291
#292
#293
#294
#295
#296
#297
#298
#299
#3
#300
#301
#302
#303
#304
#305
#306
#307
#308
#309
#311
#312
#313
#314
#317
#318
#320
#324
#326
#331
#336
#337
#338
#342
#344
#345
#346
#347
#348
#349
#35
#350
#351
#356
#357
#358
#361
#362
#367
#368
#372
#373
#374
#377
#379
#38
#380
#381
#382
#383
#386
#387
#388
#390
#391
#392
#393
#394
#395
#396
#397
#399
#40
#400
#401
#403
#406
#407
#408
#409
#41
#413
#414
#415
#417
#418
#420
#421
#422
#423
#424
#425
#426
#427
#428
#430
#431
#432
#436
#438
#439
#440
#442
#444
#445
#446
#447
#448
#449
#45
#450
#451
#453
#455
#456
#457
#458
#462
#463
#464
#467
#468
#469
#470
#471
#473
#474
#478
#481
#482
#483
#484
#485
#487
#488
#490
#491
#492
#493
#495
#499
#500
#501
#502
#503
#505
#507
#510
#513
#514
#515
#516
#519
#52
#520
#522
#523
#524
#526
#527
#528
#53
#531
#533
#534
#535
#538
#538
#539
#541
#545
#546
#547
#548
#549
#551
#552
#553
#555
#556
#557
#559
#560
#561
#562
#564
#565
#567
#569
#57
#570
#571
#572
#575
#575
#577
#578
#579
#58
#582
#587
#588
#59
#590
#595
#598
#60
#600
#601
#602
#604
#606
#607
#608
#61
#614
#614
#617
#62
#623
#63
#632
#636
#637
#644
#65
#650
#651
#652
#653
#654
#655
#657
#658
#66
#660
#661
#662
#663
#664
#665
#666
#667
#668
#669
#670
#671
#672
#678
#68
#682
#683
#684
#685
#689
#69
#691
#692
#693
#694
#696
#697
#699
#7
#70
#700
#701
#702
#703
#704
#706
#708
#709
#71
#710
#711
#712
#715
#716
#718
#722
#724
#726
#727
#728
#729
#73
#732
#733
#735
#736
#738
#739
#740
#742
#746
#747
#749
#75
#750
#751
#752
#753
#754
#755
#757
#758
#759
#764
#765
#766
#767
#769
#77
#774
#775
#776
#777
#78
#780
#782
#784
#785
#786
#788
#789
#790
#791
#792
#794
#796
#797
#798
#799
#80
#800
#801
#803
#804
#805
#809
#81
#812
#814
#815
#816
#817
#818
#819
#82
#820
#821
#822
#823
#825
#826
#827
#829
#83
#830
#831
#833
#834
#835
#836
#837
#838
#839
#840
#841
#842
#843
#844
#845
#847
#848
#849
#850
#851
#852
#853
#854
#856
#857
#858
#859
#86
#862
#863
#864
#867
#868
#869
#87
#872
#873
#874
#875
#876
#877
#878
#879
#88
#880
#881
#883
#885
#886
#887
#887
#888
#889
#89
#890
#890
#894
#896
#897
#898
#899
#90
#901
#902
#903
#904
#906
#907
#908
#909
#91
#910
#912
#913
#914
#915
#916
#917
#918
#92
#920
#921
#922
#923
#924
#925
#926
#927
#928
#929
#93
#930
#931
#933
#936
#937
#938
#939
#94
#941
#943
#944
#945
#946
#947
#947
#948
#95
#951
#952
#953
#956
#957
#958
#959
#96
#960
#961
#962
#963
#964
#966
#968
#969
#97
#970
#971
#972
#973
#975
#976
#977
#978
#979
#980
#982
#982
#983
#985
#987
#988
#990
#991
#993
#994
#995
#996
#997
#998
#999
isaac
legacy-backend
v0.1
v0.2.3
v0.4
v1.0
v1.1
v1.1.1
v1.1.2
-
540e088822
[GH-PAGES] Updated website
Philippe Tillet
2022-09-03 00:48:40 +00:00 -
d0b4c67b05
[OPTIMIZER] Improved layout conversion simplification algorithm (#97)
Philippe Tillet
2022-09-02 16:52:44 -07:00 -
6f3b262d6c
[GH-PAGES] Updated website
Philippe Tillet
2022-09-02 00:49:33 +00:00 -
3c635449e5
[Triton] Support math and libdevice ops (#91)
Shintaro Iwasaki
2022-09-01 16:34:27 -07:00 -
328b87aec6
Keren/tensor slice insert alloc (#94)
Keren Zhou
2022-09-01 12:37:17 -07:00 -
d01353de07
[CI] add assert-enabled MLIR option (#78)
Shintaro Iwasaki
2022-08-31 18:55:32 -07:00 -
41ab369e01
[GH-PAGES] Updated website
Philippe Tillet
2022-09-01 00:49:59 +00:00 -
cb05587449
[GH-PAGES] Updated website
Philippe Tillet
2022-08-31 00:55:11 +00:00 -
437ced38c2
fp8 <> bf16 conversion (#637)
Da Yan
2022-08-31 05:20:12 +08:00 -
bbcfff8e7c
[GH-PAGES] Updated website
Philippe Tillet
2022-08-30 00:54:38 +00:00 -
02ebf24d35
Analyze shared memory alias (#81)
Keren Zhou
2022-08-29 10:43:20 -07:00 -
0eaa2d3583
[GH-PAGES] Updated website
Philippe Tillet
2022-08-29 00:51:48 +00:00 -
287ed5ceeb
[GH-PAGES] Updated website
Philippe Tillet
2022-08-28 00:54:08 +00:00 -
210a296699
[BACKEND] bf16 flash-attention (#636)
Da Yan
2022-08-27 11:40:55 +08:00 -
79bb9e69b7
[GH-PAGES] Updated website
Philippe Tillet
2022-08-27 00:48:56 +00:00 -
fe0c29b9ec
Fix inconsistent struct declaration instead of class. (#632)
Daniil Fukalov
2022-08-27 02:20:21 +03:00 -
83287d7193
[CI] enable self-hosted runner (#85)
Philippe Tillet
2022-08-25 19:12:16 -07:00 -
65cfbbabe5
[GH-PAGES] Updated website
Philippe Tillet
2022-08-26 00:49:56 +00:00 -
a1fd5d2238
[GH-PAGES] Updated website
Philippe Tillet
2022-08-25 00:51:52 +00:00 -
bedbf221c0
[BACKEND] Support optional mask in TritonGPUToLLVM (#80)
goostavz
2022-08-25 08:51:37 +08:00 -
84aa7d025a
[TritonIR] simplify Load/StoreOps when mask is true/false (#79)
Shintaro Iwasaki
2022-08-24 12:55:49 -07:00 -
aaf54f10e5
[GH-PAGES] Updated website
Philippe Tillet
2022-08-24 00:50:18 +00:00 -
1b513c9866
[BACKEND] Refactoring codegen for LoadOp with PTXFormat (#77)
Yan Chunwei
2022-08-24 06:51:13 +08:00 -
0ebef11c77
[TritonIR] Make mask operand optional (#74)
Shintaro Iwasaki
2022-08-22 22:00:17 -07:00 -
de2dd04c8a
[BACKEND] two minor bugfix on StoreOpLowering and kernel launch & support optional other in LoadOpLowering (#69)
goostavz
2022-08-23 12:47:09 +08:00 -
42588a2a5c
[GH-PAGES] Updated website
Philippe Tillet
2022-08-23 00:54:28 +00:00 -
92ef552a54
[OPTIMIZER] Fix Num in AsyncWaitOp generated by the pipeline pass (#72)
Da Yan
2022-08-23 06:58:10 +08:00 -
9a6ec45a5b
[GH-PAGES] Updated website
Philippe Tillet
2022-08-22 00:50:12 +00:00 -
76917619fc
[GH-PAGES] Updated website
Philippe Tillet
2022-08-21 00:48:50 +00:00 -
b4367e5d30
[GH-PAGES] Updated website
Philippe Tillet
2022-08-20 00:47:58 +00:00 -
10ba51c3bb
[FRONTEND] add python e2e launch empty kernel test (#68)
Yan Chunwei
2022-08-20 01:46:01 +08:00 -
9aa00249a6
[TritonIR] make other optional and remove isOtherUnspecified (#67)
Shintaro Iwasaki
2022-08-18 18:19:55 -07:00 -
a7462d444b
[GH-PAGES] Updated website
Philippe Tillet
2022-08-19 00:50:31 +00:00 -
192be76b3c
[OPTIMIZER] Rewrite patterns for layout conversions (#64)
Philippe Tillet
2022-08-18 12:49:37 -07:00 -
e0bedeb44c
[BACKEND] Keren/shared memory barrier (#59)
Keren Zhou
2022-08-18 12:32:57 -07:00 -
8776ad1a0e
[OPTIMIZER] Let the pipeline pass insert async wait. (#63)
Da Yan
2022-08-19 01:31:57 +08:00 -
d69ce77b19
[FRONTEND] add an attr for masked load without explicit other (#55)
Shintaro Iwasaki
2022-08-18 09:51:37 -07:00 -
fc58250a06
[BACKEND] Add backend support of arith::AddIOp, arith::AddFOp, GetProgramIdOp & GEPOp and bugfix for SplatOp, StoreOp, FuncOp (#60)
goostavz
2022-08-18 20:46:45 +08:00 -
db7b163cb5
[GH-PAGES] Updated website
Philippe Tillet
2022-08-18 00:48:52 +00:00 -
b1673caaf6
[FRONTEND] Expose end-to-end compile to python frontend (#58)
Yan Chunwei
2022-08-18 01:42:48 +08:00 -
f20cbb2743
[GH-PAGES] Updated website
Philippe Tillet
2022-08-17 00:49:36 +00:00 -
7394d732ad
[DOCS] support for variable head dimensions in flash attention triton tutorial (#623)
Phil Wang
2022-08-15 19:16:49 -07:00 -
d1343b5511
[GH-PAGES] Updated website
Philippe Tillet
2022-08-16 01:02:32 +00:00 -
95bbac41e7
[BACKEND] Add LLVM-translation for store and splat ops (#47)
Yan Chunwei
2022-08-15 15:46:37 +08:00 -
993ba7035a
[BACKEND] Codegen bringup, index calculation of blocked_layout & support of LoadOp, BroadcastOp, ViewOp & MakeRangeOp (#38)
goostavz
2022-08-15 10:58:59 +08:00 -
943e27aa53
[GH-PAGES] Updated website
Philippe Tillet
2022-08-15 00:49:17 +00:00 -
36804ec20e
[GH-PAGES] Updated website
Philippe Tillet
2022-08-14 00:49:28 +00:00 -
e5ec8e16ea
[BUILD] Fix setup.py (#45)
Da Yan
2022-08-13 16:38:31 -07:00 -
fa97b57bec
[GH-PAGES] Updated website
Philippe Tillet
2022-08-13 00:50:31 +00:00 -
d5856435d7
[CI] explicitly run unit tests (#54)
Shintaro Iwasaki
2022-08-12 13:39:04 -07:00 -
2ba9a83465
[BUILD] fix minor issues with MLIR assert enabled (#46)
Shintaro Iwasaki
2022-08-11 21:20:47 -07:00 -
c32c097ece
[GH-PAGES] Updated website
Philippe Tillet
2022-08-12 00:47:45 +00:00 -
3a48ca0d4d
[BUILD] Fix includes (#49)
Philippe Tillet
2022-08-11 11:49:29 -07:00 -
ca12a57c3b
[GH-PAGES] Updated website
Philippe Tillet
2022-08-11 00:50:19 +00:00 -
3e2953f357
Allow multiple_of and max_contiguous to accept n-d values (#617)
Da Yan
2022-08-10 09:59:32 -07:00 -
4b51054036
[GH-PAGES] Updated website
Philippe Tillet
2022-08-10 00:48:34 +00:00 -
24ae9b82dd
[GH-PAGES] Updated website
Philippe Tillet
2022-08-09 00:51:04 +00:00 -
83ef74f248
[BACKEND] Extracting numWarps from tritonGPU module (#39)
Yan Chunwei
2022-08-09 00:40:20 +08:00 -
920723cf3d
[BACKEND] add triton-translate to translate mlir to llvmir or PTX code (#37)
Yan Chunwei
2022-08-08 13:34:36 +08:00 -
d155d9a166
[GH-PAGES] Updated website
Philippe Tillet
2022-08-08 00:48:35 +00:00 -
cc79376222
Fix deprectaion warning on CreateGEP(Value *, ArrayRef<Value *>, const Twine &) (#608)
Daniil Fukalov
2022-08-08 03:10:18 +03:00 -
490d34e0d5
[FRONTEND] Fixed python bindings link options (#40)
Philippe Tillet
2022-08-07 13:09:12 -07:00 -
355b06f4b3
[GH-PAGES] Updated website
Philippe Tillet
2022-08-07 00:51:30 +00:00 -
73ee4b1d0d
[GH-PAGES] Updated website
Philippe Tillet
2022-08-06 00:49:10 +00:00 -
78ebbe24c7
[FRONTEND] Added
ExpandDimsOp
primitive (#36)Philippe Tillet
2022-08-04 18:41:06 -07:00 -
844e79e14c
[GH-PAGES] Updated website
Philippe Tillet
2022-08-05 00:53:59 +00:00 -
a7b49b3227
[BACKEND] Memory allocation (#33)
Keren Zhou
2022-08-04 11:22:49 -07:00 -
b988bae813
Init TritonGPU to LLVM dialect conversion (#32)
Yan Chunwei
2022-08-04 10:15:45 +08:00 -
b254e2b165
[GH-PAGES] Updated website
Philippe Tillet
2022-08-04 00:49:04 +00:00 -
d84d42ef8d
[GH-PAGES] Updated website
Philippe Tillet
2022-08-03 00:52:19 +00:00 -
7b91c7befd
Fix "warning: control reaches end of non-void function". (#607)
Daniil Fukalov
2022-08-03 02:12:48 +03:00 -
de0c86c743
[GH-PAGES] Updated website
Philippe Tillet
2022-08-02 00:50:10 +00:00 -
3236642e8f
[OPTIMIZER] Added memory coalescing pass (#31)
Philippe Tillet
2022-07-31 20:59:31 -07:00 -
54ca5217e0
[GH-PAGES] Updated website
Philippe Tillet
2022-08-01 00:51:04 +00:00 -
d1593e6ca8
[TritonGPU] Improved documentation and semantics of layout encodings (#30)
Philippe Tillet
2022-07-31 13:59:44 -07:00 -
57b5951fdb
[GH-PAGES] Updated website
Philippe Tillet
2022-07-31 00:50:01 +00:00 -
968f59027e
Expose
module.print
in pybind (#604)Sharad Vikram
2022-07-29 21:36:08 -07:00 -
0c21947b50
[GH-PAGES] Updated website
Philippe Tillet
2022-07-30 00:47:48 +00:00 -
d2acaf2b1e
[GH-PAGES] Updated website
Philippe Tillet
2022-07-29 00:48:17 +00:00 -
6cffd02787
[GH-PAGES] Updated website
Philippe Tillet
2022-07-28 00:52:17 +00:00 -
e02c82c765
[TritonIR] Convert Triton dialect's
Combine
pass to MLIR DRR based (#16)Yan Chunwei
2022-07-28 03:50:08 +08:00 -
432c3df265
[BUILD] MacOS can now build compiler and run MLIR tests (#25)
Philippe Tillet
2022-07-27 01:32:10 -07:00 -
2663713aec
[GH-PAGES] Updated website
Philippe Tillet
2022-07-27 00:51:15 +00:00 -
6d62d88d4f
[CI] run clang-format (#24)
Philippe Tillet
2022-07-26 17:25:03 -07:00 -
25357083e6
[CI] Added basic CI skeletons (#23)
Philippe Tillet
2022-07-26 14:16:30 -07:00 -
3265e0df5a
[PYTHON] Cleaned up legacy code; added simple standalone compilation API (#22)
Philippe Tillet
2022-07-26 11:06:45 -07:00 -
96cc6fb563
[TritonGPU] Pretty printer for layouts (#21)
Keren Zhou
2022-07-26 10:50:11 -07:00 -
bec0049ff5
[GH-PAGES] Updated website
Philippe Tillet
2022-07-26 00:50:19 +00:00 -
923d468187
Update LICENSE (#602)
Anton Kostin
2022-07-25 19:30:03 +03:00 -
27c9f3d8cb
[FRONTEND] Added comment on TensorSizeTrait::maxElement (#20)
Philippe Tillet
2022-07-25 01:18:45 -07:00 -
7eda373a12
Add lit dependency (#9)
Keren Zhou
2022-07-24 19:14:52 -07:00 -
84440be392
[GH-PAGES] Updated website
Philippe Tillet
2022-07-25 00:49:58 +00:00 -
027321cdcf
[FRONTEND] Make tl.rand() 1-exclusive (#601)
Jason Ansel
2022-07-24 17:47:23 -07:00 -
e02e56dc63
[FRONTEND] Add missing rfloordiv (#598)
Jason Ansel
2022-07-23 21:54:12 -07:00 -
ab56d310dd
[BACKEND][IR] Fixed up internal dtype size for booleans (1bit -> 8bit) (#600)
Philippe Tillet
2022-07-23 20:08:03 -07:00 -
379acd8521
[GH-PAGES] Updated website
Philippe Tillet
2022-07-24 00:52:21 +00:00 -
5f9c7bc693
[GH-PAGES] Updated website
Philippe Tillet
2022-07-23 00:49:35 +00:00 -
f28caddbf8
[FRONTEND] Allow tl.where to select pointers (#595)
Da Yan
2022-07-22 00:54:27 +08:00 -
10f1d77697
[GH-PAGES] Updated website
Philippe Tillet
2022-07-21 00:49:23 +00:00 -
af85f5fa46
[FRONTEND] Refresh cache when the source code of outlined functions are changed (#590)
Keren Zhou
2022-07-20 17:34:07 -07:00