[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-06-20 00:46:53 +00:00
parent 1f4cea595d
commit ab91a5bbc3
158 changed files with 234 additions and 234 deletions

View File

@@ -459,37 +459,37 @@ We can now compare the performance of our kernel against that of cuBLAS. Here we
matmul-performance:
M cuBLAS ... Triton Triton (+ LeakyReLU)
0 256.0 2.730667 ... 2.978909 3.276800
1 384.0 7.372800 ... 7.899428 7.899428
2 512.0 14.563555 ... 15.420235 15.420235
0 256.0 2.730667 ... 2.978909 2.978909
1 384.0 7.372800 ... 8.507077 8.507077
2 512.0 14.563555 ... 16.384000 15.420235
3 640.0 22.260869 ... 24.380953 24.380953
4 768.0 32.768000 ... 35.389441 34.028308
5 896.0 37.971025 ... 40.140799 39.025776
4 768.0 32.768000 ... 34.028308 34.028308
5 896.0 39.025776 ... 40.140799 39.025776
6 1024.0 49.932191 ... 53.773130 52.428801
7 1152.0 45.242181 ... 48.161033 47.396572
8 1280.0 51.200001 ... 57.690139 57.690139
9 1408.0 64.138541 ... 68.147202 66.485074
10 1536.0 80.430545 ... 81.355034 79.526831
11 1664.0 63.372618 ... 63.372618 62.492442
10 1536.0 79.526831 ... 80.430545 78.643199
11 1664.0 62.929456 ... 63.372618 62.492442
12 1792.0 72.983276 ... 73.460287 59.467852
13 1920.0 69.120002 ... 71.257735 70.892307
14 2048.0 73.262953 ... 78.033565 76.959706
15 2176.0 83.155572 ... 87.876193 85.998493
16 2304.0 68.251065 ... 78.064941 77.307030
17 2432.0 71.487187 ... 86.979769 85.915795
18 2560.0 78.019048 ... 82.747477 81.108913
19 2688.0 83.922689 ... 90.316801 88.836198
20 2816.0 82.135981 ... 85.017948 84.035084
21 2944.0 81.967162 ... 83.060049 81.832567
22 3072.0 81.121923 ... 89.593522 88.060814
23 3200.0 84.768213 ... 97.116842 95.380032
24 3328.0 83.613586 ... 85.602017 84.101981
25 3456.0 81.849303 ... 86.503829 83.893412
26 3584.0 86.457107 ... 98.699661 97.205829
27 3712.0 82.491612 ... 89.273764 84.444075
28 3840.0 85.070769 ... 87.217666 91.247522
29 3968.0 89.690508 ... 92.024087 85.004484
30 4096.0 94.320258 ... 90.200084 82.241256
16 2304.0 68.446623 ... 78.064941 77.057651
17 2432.0 71.305746 ... 86.711310 84.621881
18 2560.0 77.833728 ... 82.331658 81.715711
19 2688.0 83.369354 ... 90.966561 89.044730
20 2816.0 82.602666 ... 84.197315 83.552120
21 2944.0 81.698415 ... 83.477440 82.509987
22 3072.0 82.420822 ... 86.053349 88.612060
23 3200.0 84.712112 ... 89.387425 95.096582
24 3328.0 83.808259 ... 85.703924 84.397770
25 3456.0 82.519518 ... 91.928814 89.579522
26 3584.0 85.552231 ... 95.756542 95.654673
27 3712.0 86.044224 ... 89.353616 83.386762
28 3840.0 85.201850 ... 93.012618 85.597527
29 3968.0 91.747320 ... 85.330496 89.789505
30 4096.0 91.522488 ... 90.687655 90.382307
[31 rows x 5 columns]
@@ -499,7 +499,7 @@ We can now compare the performance of our kernel against that of cuBLAS. Here we
.. rst-class:: sphx-glr-timing
**Total running time of the script:** ( 6 minutes 6.038 seconds)
**Total running time of the script:** ( 6 minutes 6.898 seconds)
.. _sphx_glr_download_getting-started_tutorials_03-matrix-multiplication.py: