[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-06-10 00:47:50 +00:00
parent 2e87f7645e
commit 8168c311b3
158 changed files with 288 additions and 288 deletions

View File

@@ -459,37 +459,37 @@ We can now compare the performance of our kernel against that of cuBLAS. Here we
matmul-performance:
M cuBLAS ... Triton Triton (+ LeakyReLU)
0 256.0 2.730667 ... 2.978909 2.978909
1 384.0 7.372800 ... 7.899428 7.899428
0 256.0 2.730667 ... 2.978909 3.276800
1 384.0 7.372800 ... 8.507077 8.507077
2 512.0 14.563555 ... 15.420235 15.420235
3 640.0 22.260869 ... 24.380953 24.380953
4 768.0 32.768000 ... 34.028308 34.028308
4 768.0 32.768000 ... 35.389441 34.028308
5 896.0 37.971025 ... 41.321411 39.025776
6 1024.0 49.932191 ... 53.773130 52.428801
7 1152.0 45.242181 ... 48.161033 47.396572
8 1280.0 51.200001 ... 57.690139 57.690139
9 1408.0 64.138541 ... 68.147202 67.305878
10 1536.0 79.526831 ... 80.430545 78.643199
11 1664.0 63.372618 ... 63.372618 62.061463
12 1792.0 72.983276 ... 63.499573 62.790080
13 1920.0 68.776119 ... 71.257735 70.892307
10 1536.0 80.430545 ... 80.430545 78.643199
11 1664.0 62.929456 ... 63.372618 62.492442
12 1792.0 72.983276 ... 63.499573 63.142831
13 1920.0 69.120002 ... 71.257735 70.892307
14 2048.0 73.262953 ... 78.033565 76.959706
15 2176.0 83.155572 ... 87.115360 85.632545
16 2304.0 68.643310 ... 78.064941 76.809875
17 2432.0 71.125224 ... 75.522751 74.521127
18 2560.0 77.833728 ... 82.331658 81.310171
19 2688.0 83.737433 ... 90.966561 89.464755
20 2816.0 83.712490 ... 84.523664 83.392363
21 2944.0 82.373605 ... 84.324925 83.899046
22 3072.0 80.890151 ... 88.335577 79.526831
23 3200.0 83.989503 ... 96.530922 93.979441
24 3328.0 83.808259 ... 86.736504 86.113988
25 3456.0 80.380430 ... 92.138932 89.679166
26 3584.0 88.152348 ... 92.505546 95.451583
27 3712.0 82.423549 ... 87.706180 87.170458
28 3840.0 84.292684 ... 92.275341 86.943395
29 3968.0 91.816356 ... 87.035620 90.054568
30 4096.0 91.867031 ... 93.924229 87.896352
16 2304.0 68.251065 ... 78.064941 76.809875
17 2432.0 71.305746 ... 75.522751 74.521127
18 2560.0 77.833728 ... 82.125311 81.512437
19 2688.0 84.108772 ... 90.316801 90.102270
20 2816.0 79.154642 ... 84.197315 82.602666
21 2944.0 82.237674 ... 83.477440 83.060049
22 3072.0 82.301023 ... 90.020831 88.060814
23 3200.0 83.934428 ... 96.385543 94.534716
24 3328.0 83.226931 ... 86.424125 85.398926
25 3456.0 81.026701 ... 87.632137 90.687926
26 3584.0 87.296493 ... 99.905993 98.160909
27 3712.0 83.178475 ... 89.194055 85.675250
28 3840.0 83.781816 ... 92.236860 85.103501
29 3968.0 91.850912 ... 87.945181 90.321193
30 4096.0 87.495257 ... 90.412750 92.755862
[31 rows x 5 columns]
@@ -499,7 +499,7 @@ We can now compare the performance of our kernel against that of cuBLAS. Here we
.. rst-class:: sphx-glr-timing
**Total running time of the script:** ( 6 minutes 11.431 seconds)
**Total running time of the script:** ( 6 minutes 7.264 seconds)
.. _sphx_glr_download_getting-started_tutorials_03-matrix-multiplication.py: