[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-06-16 00:46:38 +00:00
parent 4e12c1cfa5
commit 2c4a040453
156 changed files with 252 additions and 252 deletions

View File

@@ -459,37 +459,37 @@ We can now compare the performance of our kernel against that of cuBLAS. Here we
matmul-performance:
M cuBLAS ... Triton Triton (+ LeakyReLU)
0 256.0 2.730667 ... 2.978909 2.978909
1 384.0 7.372800 ... 7.899428 7.899428
2 512.0 14.563555 ... 15.420235 15.420235
0 256.0 2.730667 ... 3.276800 3.276800
1 384.0 7.372800 ... 7.899428 8.507077
2 512.0 14.563555 ... 15.420235 16.384000
3 640.0 22.260869 ... 24.380953 24.380953
4 768.0 32.768000 ... 34.028308 34.028308
5 896.0 39.025776 ... 40.140799 39.025776
5 896.0 37.971025 ... 40.140799 39.025776
6 1024.0 49.932191 ... 53.773130 52.428801
7 1152.0 45.242181 ... 48.161033 47.396572
8 1280.0 51.200001 ... 57.690139 57.690139
9 1408.0 64.138541 ... 68.147202 66.485074
10 1536.0 79.526831 ... 80.430545 79.526831
11 1664.0 62.929456 ... 63.372618 62.492442
10 1536.0 80.430545 ... 81.355034 78.643199
11 1664.0 63.372618 ... 63.372618 62.492442
12 1792.0 72.983276 ... 73.460287 59.467852
13 1920.0 68.776119 ... 71.257735 70.892307
14 2048.0 73.262953 ... 78.033565 76.959706
15 2176.0 83.155572 ... 87.494120 85.998493
16 2304.0 68.446623 ... 78.064941 77.057651
17 2432.0 71.125224 ... 86.711310 75.118889
18 2560.0 77.833728 ... 82.956960 81.715711
19 2688.0 83.737433 ... 90.748936 89.254248
20 2816.0 80.916902 ... 85.017948 83.074685
21 2944.0 80.510553 ... 83.060049 82.509987
22 3072.0 80.316458 ... 89.735509 89.310890
23 3200.0 83.769634 ... 95.952022 95.522391
24 3328.0 83.226931 ... 85.398926 84.795401
25 3456.0 82.519518 ... 92.455926 87.632137
26 3584.0 87.381330 ... 98.483450 97.840469
27 3712.0 84.159518 ... 88.797643 87.437503
28 3840.0 80.901241 ... 92.006659 89.187096
29 3968.0 88.040360 ... 86.144680 89.921841
30 4096.0 91.025923 ... 93.760204 87.438257
15 2176.0 83.155572 ... 87.494120 85.632545
16 2304.0 68.251065 ... 78.064941 77.307030
17 2432.0 71.487187 ... 86.711310 74.918570
18 2560.0 77.833728 ... 82.331658 81.512437
19 2688.0 84.015627 ... 89.888756 89.464755
20 2816.0 84.035084 ... 84.852542 83.233226
21 2944.0 82.237674 ... 83.899046 82.509987
22 3072.0 82.062468 ... 89.735509 88.473602
23 3200.0 83.224970 ... 95.952022 95.665176
24 3328.0 82.891535 ... 85.703924 84.995628
25 3456.0 82.604067 ... 92.191613 86.876687
26 3584.0 87.296493 ... 98.375705 97.840469
27 3712.0 85.091436 ... 89.194055 86.192706
28 3840.0 85.070769 ... 93.484358 88.191387
29 3968.0 91.816356 ... 88.167587 91.472214
30 4096.0 88.475759 ... 89.777746 88.359266
[31 rows x 5 columns]
@@ -499,7 +499,7 @@ We can now compare the performance of our kernel against that of cuBLAS. Here we
.. rst-class:: sphx-glr-timing
**Total running time of the script:** ( 6 minutes 4.630 seconds)
**Total running time of the script:** ( 6 minutes 6.317 seconds)
.. _sphx_glr_download_getting-started_tutorials_03-matrix-multiplication.py: