Commit Graph
Select branches
Hide Pull Requests
fix-extelemwise-in-combine-ops
gh-pages
jit-hook
keren/assert
keren/improve-hook
keren/insert-slice-other-nonzero
keren/perf-debug
keren/v100-perf-regression
master
phil/fused-attention-perf-fixup
phil/mma-v1-is-row-debug
phil/swizzle-bug-repro
port-fma
rocm
#10
#100
#1000
#1001
#1002
#1004
#1006
#1007
#1008
#101
#1010
#1012
#1013
#1013
#1014
#1018
#1019
#102
#1020
#1020
#1025
#1027
#1028
#1029
#103
#1030
#1033
#1034
#1036
#1037
#1038
#1039
#104
#1042
#1043
#1043
#105
#106
#107
#108
#109
#11
#11
#110
#111
#112
#114
#116
#118
#119
#120
#121
#123
#124
#125
#126
#127
#128
#129
#13
#131
#132
#133
#134
#135
#136
#137
#138
#139
#140
#141
#142
#143
#144
#145
#146
#147
#148
#149
#15
#151
#152
#158
#164
#165
#167
#168
#172
#173
#178
#179
#18
#180
#185
#186
#188
#19
#190
#192
#193
#195
#198
#199
#20
#200
#203
#204
#205
#207
#209
#212
#219
#22
#222
#224
#225
#228
#23
#23
#231
#24
#240
#249
#250
#251
#253
#255
#256
#257
#258
#259
#260
#261
#268
#271
#272
#273
#276
#279
#28
#280
#281
#282
#283
#285
#286
#287
#288
#291
#292
#293
#294
#295
#296
#297
#298
#299
#3
#300
#301
#302
#303
#304
#305
#306
#307
#308
#309
#311
#312
#313
#314
#317
#318
#320
#324
#326
#331
#336
#337
#338
#342
#344
#345
#346
#347
#348
#349
#35
#350
#351
#356
#357
#358
#361
#362
#367
#368
#372
#373
#374
#377
#379
#38
#380
#381
#382
#383
#386
#387
#388
#390
#391
#392
#393
#394
#395
#396
#397
#399
#40
#400
#401
#403
#406
#407
#408
#409
#41
#413
#414
#415
#417
#418
#420
#421
#422
#423
#424
#425
#426
#427
#428
#430
#431
#432
#436
#438
#439
#440
#442
#444
#445
#446
#447
#448
#449
#45
#450
#451
#453
#455
#456
#457
#458
#462
#463
#464
#467
#468
#469
#470
#471
#473
#474
#478
#481
#482
#483
#484
#485
#487
#488
#490
#491
#492
#493
#495
#499
#500
#501
#502
#503
#505
#507
#510
#513
#514
#515
#516
#519
#52
#520
#522
#523
#524
#526
#527
#528
#53
#531
#533
#534
#535
#538
#538
#539
#541
#545
#546
#547
#548
#549
#551
#552
#553
#555
#556
#557
#559
#560
#561
#562
#564
#565
#567
#569
#57
#570
#571
#572
#575
#575
#577
#578
#579
#58
#582
#587
#588
#59
#590
#595
#598
#60
#600
#601
#602
#604
#606
#607
#608
#61
#614
#614
#617
#62
#623
#63
#632
#636
#637
#644
#65
#650
#651
#652
#653
#654
#655
#657
#658
#66
#660
#661
#662
#663
#664
#665
#666
#667
#668
#669
#670
#671
#672
#678
#68
#682
#683
#684
#685
#689
#69
#691
#692
#693
#694
#696
#697
#699
#7
#70
#700
#701
#702
#703
#704
#706
#708
#709
#71
#710
#711
#712
#715
#716
#718
#722
#724
#726
#727
#728
#729
#73
#732
#733
#735
#736
#738
#739
#740
#742
#746
#747
#749
#75
#750
#751
#752
#753
#754
#755
#757
#758
#759
#764
#765
#766
#767
#769
#77
#774
#775
#776
#777
#78
#780
#782
#784
#785
#786
#788
#789
#790
#791
#792
#794
#796
#797
#798
#799
#80
#800
#801
#803
#804
#805
#809
#81
#812
#814
#815
#816
#817
#818
#819
#82
#820
#821
#822
#823
#825
#826
#827
#829
#83
#830
#831
#833
#834
#835
#836
#837
#838
#839
#840
#841
#842
#843
#844
#845
#847
#848
#849
#850
#851
#852
#853
#854
#856
#857
#858
#859
#86
#862
#863
#864
#867
#868
#869
#87
#872
#873
#874
#875
#876
#877
#878
#879
#88
#880
#881
#883
#885
#886
#887
#887
#888
#889
#89
#890
#890
#894
#896
#897
#898
#899
#90
#901
#902
#903
#904
#906
#907
#908
#909
#91
#910
#912
#913
#914
#915
#916
#917
#918
#92
#920
#921
#922
#923
#924
#925
#926
#927
#928
#929
#93
#930
#931
#933
#936
#937
#938
#939
#94
#941
#943
#944
#945
#946
#947
#947
#948
#95
#951
#952
#953
#956
#957
#958
#959
#96
#960
#961
#962
#963
#964
#966
#968
#969
#97
#970
#971
#972
#973
#975
#976
#977
#978
#979
#980
#982
#982
#983
#985
#987
#988
#990
#991
#993
#994
#995
#996
#997
#998
#999
isaac
legacy-backend
v0.1
v0.2.3
v0.4
v1.0
v1.1
v1.1.1
v1.1.2
Select branches
Hide Pull Requests
fix-extelemwise-in-combine-ops
gh-pages
jit-hook
keren/assert
keren/improve-hook
keren/insert-slice-other-nonzero
keren/perf-debug
keren/v100-perf-regression
master
phil/fused-attention-perf-fixup
phil/mma-v1-is-row-debug
phil/swizzle-bug-repro
port-fma
rocm
#10
#100
#1000
#1001
#1002
#1004
#1006
#1007
#1008
#101
#1010
#1012
#1013
#1013
#1014
#1018
#1019
#102
#1020
#1020
#1025
#1027
#1028
#1029
#103
#1030
#1033
#1034
#1036
#1037
#1038
#1039
#104
#1042
#1043
#1043
#105
#106
#107
#108
#109
#11
#11
#110
#111
#112
#114
#116
#118
#119
#120
#121
#123
#124
#125
#126
#127
#128
#129
#13
#131
#132
#133
#134
#135
#136
#137
#138
#139
#140
#141
#142
#143
#144
#145
#146
#147
#148
#149
#15
#151
#152
#158
#164
#165
#167
#168
#172
#173
#178
#179
#18
#180
#185
#186
#188
#19
#190
#192
#193
#195
#198
#199
#20
#200
#203
#204
#205
#207
#209
#212
#219
#22
#222
#224
#225
#228
#23
#23
#231
#24
#240
#249
#250
#251
#253
#255
#256
#257
#258
#259
#260
#261
#268
#271
#272
#273
#276
#279
#28
#280
#281
#282
#283
#285
#286
#287
#288
#291
#292
#293
#294
#295
#296
#297
#298
#299
#3
#300
#301
#302
#303
#304
#305
#306
#307
#308
#309
#311
#312
#313
#314
#317
#318
#320
#324
#326
#331
#336
#337
#338
#342
#344
#345
#346
#347
#348
#349
#35
#350
#351
#356
#357
#358
#361
#362
#367
#368
#372
#373
#374
#377
#379
#38
#380
#381
#382
#383
#386
#387
#388
#390
#391
#392
#393
#394
#395
#396
#397
#399
#40
#400
#401
#403
#406
#407
#408
#409
#41
#413
#414
#415
#417
#418
#420
#421
#422
#423
#424
#425
#426
#427
#428
#430
#431
#432
#436
#438
#439
#440
#442
#444
#445
#446
#447
#448
#449
#45
#450
#451
#453
#455
#456
#457
#458
#462
#463
#464
#467
#468
#469
#470
#471
#473
#474
#478
#481
#482
#483
#484
#485
#487
#488
#490
#491
#492
#493
#495
#499
#500
#501
#502
#503
#505
#507
#510
#513
#514
#515
#516
#519
#52
#520
#522
#523
#524
#526
#527
#528
#53
#531
#533
#534
#535
#538
#538
#539
#541
#545
#546
#547
#548
#549
#551
#552
#553
#555
#556
#557
#559
#560
#561
#562
#564
#565
#567
#569
#57
#570
#571
#572
#575
#575
#577
#578
#579
#58
#582
#587
#588
#59
#590
#595
#598
#60
#600
#601
#602
#604
#606
#607
#608
#61
#614
#614
#617
#62
#623
#63
#632
#636
#637
#644
#65
#650
#651
#652
#653
#654
#655
#657
#658
#66
#660
#661
#662
#663
#664
#665
#666
#667
#668
#669
#670
#671
#672
#678
#68
#682
#683
#684
#685
#689
#69
#691
#692
#693
#694
#696
#697
#699
#7
#70
#700
#701
#702
#703
#704
#706
#708
#709
#71
#710
#711
#712
#715
#716
#718
#722
#724
#726
#727
#728
#729
#73
#732
#733
#735
#736
#738
#739
#740
#742
#746
#747
#749
#75
#750
#751
#752
#753
#754
#755
#757
#758
#759
#764
#765
#766
#767
#769
#77
#774
#775
#776
#777
#78
#780
#782
#784
#785
#786
#788
#789
#790
#791
#792
#794
#796
#797
#798
#799
#80
#800
#801
#803
#804
#805
#809
#81
#812
#814
#815
#816
#817
#818
#819
#82
#820
#821
#822
#823
#825
#826
#827
#829
#83
#830
#831
#833
#834
#835
#836
#837
#838
#839
#840
#841
#842
#843
#844
#845
#847
#848
#849
#850
#851
#852
#853
#854
#856
#857
#858
#859
#86
#862
#863
#864
#867
#868
#869
#87
#872
#873
#874
#875
#876
#877
#878
#879
#88
#880
#881
#883
#885
#886
#887
#887
#888
#889
#89
#890
#890
#894
#896
#897
#898
#899
#90
#901
#902
#903
#904
#906
#907
#908
#909
#91
#910
#912
#913
#914
#915
#916
#917
#918
#92
#920
#921
#922
#923
#924
#925
#926
#927
#928
#929
#93
#930
#931
#933
#936
#937
#938
#939
#94
#941
#943
#944
#945
#946
#947
#947
#948
#95
#951
#952
#953
#956
#957
#958
#959
#96
#960
#961
#962
#963
#964
#966
#968
#969
#97
#970
#971
#972
#973
#975
#976
#977
#978
#979
#980
#982
#982
#983
#985
#987
#988
#990
#991
#993
#994
#995
#996
#997
#998
#999
isaac
legacy-backend
v0.1
v0.2.3
v0.4
v1.0
v1.1
v1.1.1
v1.1.2
-
9b2bc88d11
[BACKEND] Better bf16 support (#588)
daadaada
2022-07-20 12:22:37 +08:00 -
ec25d931b6
[GH-PAGES] Updated website
Philippe Tillet
2022-07-20 00:49:33 +00:00 -
a633d2b403
[Analysis] Added Axis Info Analysis (#8)
Philippe Tillet
2022-07-19 13:38:48 -07:00 -
9f8b4adf8e
[GH-PAGES] Updated website
Philippe Tillet
2022-07-19 00:52:52 +00:00 -
86cab58d89
[CI] Changed dev wheel date to UTC time to match CRON schedule (#587)
Philippe Tillet
2022-07-18 14:54:13 -07:00 -
30db1c142b
[GH-PAGES] Updated website
Philippe Tillet
2022-07-18 00:48:21 +00:00 -
d9354b9fbb
[GH-PAGES] Updated website
Philippe Tillet
2022-07-17 00:49:40 +00:00 -
dadcd858ad
[GH-PAGES] Updated website
Philippe Tillet
2022-07-16 00:49:55 +00:00 -
df940aaab0
Merge pull request #7 from openai/broadcastAxis-fix
Philippe Tillet
2022-07-15 08:39:49 -07:00 -
63e6a85901
Fix blocked layout parser
Yan Da
2022-07-15 15:19:11 +08:00 -
ca34df1084
[GH-PAGES] Updated website
Philippe Tillet
2022-07-15 00:51:15 +00:00 -
d1c6625bfd
[GH-PAGES] Updated website
Philippe Tillet
2022-07-14 07:22:19 +00:00 -
5b04331dd2
[TUTORIALS] Added more credits in fused attention tutorial
Phil Tillet
2022-07-13 23:48:58 -07:00 -
0a3f3d5f25
[PACKAGING] Include triton/language/libdevice.10.bc in package data (#582)
Jason Ansel
2022-07-13 23:45:27 -07:00 -
4912916c11
[FRONTEND] Added support for element-wise function defined in external LLVM bitcode (e.g., libdevice) (#562)
Keren Zhou
2022-07-13 15:52:21 -07:00 -
971f5782b4
[tutorials] Added flash attention credits in tutorial
Phil Tillet
2022-07-11 18:56:48 -07:00 -
d5eb9bc230
[tutorial] Added bwd in fused attention example (#579)
Philippe Tillet
2022-07-11 15:43:46 -07:00 -
c9a2b9c7d4
[FRONTEND] Add missing args to get_simd_tflops() (#578)
Jason Ansel
2022-07-11 14:37:59 -07:00 -
65237f6117
[PACKAGING] Added FileCheck
Phil Tillet
2022-07-07 16:53:19 -07:00 -
4a399a7e40
[BACKEND] Fix some bugs (atomics, a segfault...) (#577)
Philippe Tillet
2022-07-06 20:03:04 -07:00 -
22105bc33b
[FRONTEND] Added type check in semantic arange (#572)
vesuppi
2022-07-03 15:25:37 -07:00 -
4bf509889b
[BUILD] Change the default build type to Release (#571)
Keren Zhou
2022-07-01 12:17:22 -07:00 -
a74cce375f
[FRONTEND] Raise broadcast error (#555)
Keren Zhou
2022-06-30 17:32:07 -07:00 -
f733327ba4
[BACKEND][CODEGEN] Disabling L2 residency control by default (#570)
Philippe Tillet
2022-06-29 17:05:13 -07:00 -
1bbb2430d9
[TUTORIALS] adjust heuristics for dwdb kernel (#565)
Natalia Gimelshein
2022-06-29 17:00:22 -07:00 -
1895ceaa2d
[TUTORIAL] Fix f-string for older python (#569)
Kashif Rasul
2022-06-29 18:39:10 +02:00 -
feb7a2a0dc
[FRONTEND] Hotfix for
store
argument order (#567)Philippe Tillet
2022-06-28 00:24:02 -07:00 -
5b4c8f221e
[BACKEND] Compiler improvements (#557)
Philippe Tillet
2022-06-27 11:49:19 -07:00 -
3e815114fd
[GH-PAGES] Updated website
Philippe Tillet
2022-06-27 00:48:22 +00:00 -
87413bc925
[BACKEND] Fix layout convert for non-contiguous input (#564)
Keren Zhou
2022-06-25 23:12:03 -07:00 -
09a0e3767a
[GH-PAGES] Updated website
Philippe Tillet
2022-06-26 00:50:12 +00:00 -
37abd97851
[GH-PAGES] Updated website
Philippe Tillet
2022-06-25 00:46:57 +00:00 -
08c4b2c3be
[GH-PAGES] Updated website
Philippe Tillet
2022-06-24 00:46:49 +00:00 -
d345ddf837
[DOCS] Separate atomic cas from other atomic operations since operands are very different (#559)
Keren Zhou
2022-06-22 17:51:17 -07:00 -
77f3a2cf96
[GH-PAGES] Updated website
Philippe Tillet
2022-06-23 00:46:26 +00:00 -
b02bac41ba
[CI] Change cache dir (#561)
Keren Zhou
2022-06-22 11:44:35 -07:00 -
6bf3700c9c
[GH-PAGES] Updated website
Philippe Tillet
2022-06-22 00:49:37 +00:00 -
a428cf0bb2
[FRONTEND] Fix pytorch warning. (#560)
Keren Zhou
2022-06-20 20:12:09 -07:00 -
c168f03e0c
[GH-PAGES] Updated website
Philippe Tillet
2022-06-21 00:46:27 +00:00 -
ab91a5bbc3
[GH-PAGES] Updated website
Philippe Tillet
2022-06-20 00:46:53 +00:00 -
1f4cea595d
[GH-PAGES] Updated website
Philippe Tillet
2022-06-19 00:46:49 +00:00 -
9d1b5e3f79
special encoding for broadcast
Yan Da
2022-06-18 21:16:45 +08:00 -
53cf93ce6a
Revert "Remove TypeConverter from TritonToTritonGPU conversion"
Yan Da
2022-06-18 14:57:41 +08:00 -
64d0b87ef0
Remove TypeConverter from TritonToTritonGPU conversion
Yan Da
2022-06-18 14:34:59 +08:00 -
5de1b15fff
[GH-PAGES] Updated website
Philippe Tillet
2022-06-18 00:47:57 +00:00 -
9feb256b71
op combine in Triton Dialect: broadcast(cst) -> cst
Yan Da
2022-06-17 16:19:47 +08:00 -
4412443b59
[GH-PAGES] Updated website
Philippe Tillet
2022-06-17 00:46:29 +00:00 -
2c4a040453
[GH-PAGES] Updated website
Philippe Tillet
2022-06-16 00:46:38 +00:00 -
b5e728cb14
Add argmin argmax (#552)
Keren Zhou
2022-06-15 13:55:20 -07:00 -
6b9756532f
[BACKEND] Remove print in coalesce.cc (#551)
Jason Ansel
2022-06-15 13:13:20 -07:00 -
8ce2c12e33
[PYTHON] move ephemeral files to homedir (#549)
Madeleine Thompson
2022-06-13 19:37:52 -07:00 -
4e12c1cfa5
[GH-PAGES] Updated website
Philippe Tillet
2022-06-14 00:49:31 +00:00 -
93209c07e0
[BACKEND][CODEGEN] Fix reduce uint (#547)
Keren Zhou
2022-06-13 16:43:57 -07:00 -
58c8889235
[FRONTEND] Fix scanline layout (#548)
Philippe Tillet
2022-06-13 16:21:10 -07:00 -
7094657aa9
[FRONTEND] fix bool conversion of floating types (#545)
Natalia Gimelshein
2022-06-13 15:52:37 -07:00 -
928064b729
[GH-PAGES] Updated website
Philippe Tillet
2022-06-13 00:48:38 +00:00 -
35736aa44e
more progress on the testing infrastructure
Yan Da
2022-06-12 15:14:45 +08:00 -
b9fb88f0a6
[GH-PAGES] Updated website
Philippe Tillet
2022-06-12 00:47:57 +00:00 -
410d612f77
[GH-PAGES] Updated website
Philippe Tillet
2022-06-11 00:48:41 +00:00 -
22c65a53d9
more progress on test/CMakeLists.txt
Yan Da
2022-06-10 21:37:56 +08:00 -
0ee6e486f8
add cse pass to the pipeline & pass num-warps as an argument
Yan Da
2022-06-10 17:31:48 +08:00 -
8168c311b3
[GH-PAGES] Updated website
Philippe Tillet
2022-06-10 00:47:50 +00:00 -
2e87f7645e
[GH-PAGES] Updated website
Philippe Tillet
2022-06-09 00:48:08 +00:00 -
117a402c1b
more comments to TypeConverter & update warpTileSize
Yan Da
2022-06-08 16:20:07 +08:00 -
49d1821149
conversion test
Yan Da
2022-06-08 16:19:15 +08:00 -
38573d1261
[FRONTEND] Return allocated registers and spilled registers for users (#541)
Keren Zhou
2022-06-07 18:37:12 -07:00 -
ec86d5284f
[GH-PAGES] Updated website
Philippe Tillet
2022-06-08 00:46:05 +00:00 -
26fcc12afd
better unit tests
Yan Da
2022-06-07 19:35:38 +08:00 -
7b09b5f9e9
the pipeline pass now generates and accepts valid IR
Yan Da
2022-06-07 19:34:59 +08:00 -
560e29229b
register conversion in triton-opt
Yan Da
2022-06-07 19:33:51 +08:00 -
563da1150b
[GH-PAGES] Updated website
Philippe Tillet
2022-06-07 00:45:33 +00:00 -
2cdc6d35c4
[FRONTEND] Give col_per_thread an initial value to make the compiler happy (#535)
Mengchi Zhang
2022-06-06 12:48:23 -07:00 -
f13cbaab9f
[FRONTEND] assert that num_warps is a power of 2 (#539)
TC
2022-06-06 14:37:08 -04:00 -
0e11435448
more tests
Yan Da
2022-06-06 21:10:28 +08:00 -
366dddc3bc
update mma encoding & triton-opt
Yan Da
2022-06-06 21:03:58 +08:00 -
9927d8c291
[GH-PAGES] Updated website
Philippe Tillet
2022-06-06 00:46:47 +00:00 -
fd3a9985ea
[GH-PAGES] Updated website
Philippe Tillet
2022-06-05 21:05:02 +00:00 -
751e325d2e
[TUTORIALS] Fixed typo
Philippe Tillet
2022-06-05 13:32:35 -07:00 -
a598db498f
[GH-PAGES] Updated website
Philippe Tillet
2022-06-05 20:22:31 +00:00 -
537d98825f
[GH-PAGES] Updated website
Philippe Tillet
2022-06-05 19:52:40 +00:00 -
801c8a4c92
[TUTORIALS] Fixed typo
Philippe Tillet
2022-06-05 12:32:07 -07:00 -
5803154ef2
[GH-PAGES] Updated website
Philippe Tillet
2022-06-05 18:58:41 +00:00 -
7807f64ef3
rename sharded_layout => blocked_layout
Yan Da
2022-06-05 16:14:59 +08:00 -
bbf75b492f
more tests
Yan Da
2022-06-05 15:10:09 +08:00 -
a4a2c72173
default address space of PointerType 0 => 1
Yan Da
2022-06-05 15:09:41 +08:00 -
d5eca56cf3
more TritonGPU unit tests
Yan Da
2022-06-05 14:25:09 +08:00 -
b02792c1b7
[GH-PAGES] Updated website
Philippe Tillet
2022-06-05 00:40:28 +00:00 -
55cf9a0a97
Add triton's opt
Yan Da
2022-06-04 22:10:00 +08:00 -
c691ca835d
[GH-PAGES] Updated website
Philippe Tillet
2022-06-04 00:51:13 +00:00 -
8876e53206
[BACKEND] Restored reduction bugfixes
Philippe Tillet
2022-06-03 11:38:52 -07:00 -
a60374a597
Revert "[BACKEND] Various bug fixes; making reductions faster (#533)".
Philippe Tillet
2022-06-03 11:36:06 -07:00 -
4d64e1b60e
[GH-PAGES] Updated website
Philippe Tillet
2022-06-03 00:42:29 +00:00 -
efa04cac1f
[FRONTEND] A couple of bugfixes (#534)
Philippe Tillet
2022-06-02 16:57:37 -07:00 -
830fe19d58
Merge branch 'mlir-rewrite' of https://github.com/daadaada/mlir-rewrite into mlir-rewrite
Yan Da
2022-06-01 10:59:20 +08:00 -
935390dc03
update examples
Yan Da
2022-06-01 10:59:16 +08:00 -
3e7500dfe6
[BACKEND] Various bug fixes; making reductions faster (#533)
Philippe Tillet
2022-05-31 17:14:44 -07:00 -
e36a54eb86
more progress on the definition of layouts
Da Yan
2022-05-31 11:43:21 +00:00 -
37037bb3be
[FRONTEND] Default cache dir to /tmp/triton_$USER (#527)
Bert Maher
2022-05-27 16:51:05 -04:00 -
c82a206684
[FRONTEND] Better dot error message (#531)
Philippe Tillet
2022-05-26 17:41:09 -07:00 -
41d338d848
Fix op mapping in pipeline.cpp
Yan Da
2022-05-26 13:57:01 +08:00