[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-04-14 00:44:57 +00:00
parent 9d65bf62fb
commit 2a53636c23
158 changed files with 346 additions and 346 deletions

View File

@@ -563,43 +563,43 @@ torch_output=tensor([[ 1.1045, -36.9688, 31.4688, ..., -11.3906, 24.4531, -3
<img alt="03 matrix multiplication" class="sphx-glr-single-img" src="../../_images/sphx_glr_03-matrix-multiplication_001.png" />
<p class="sphx-glr-script-out">Out:</p>
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>matmul-performance:
M cuBLAS ... Triton Triton (+ LeakyReLU)
0 256.0 2.730667 ... 3.276800 2.978909
1 384.0 7.372800 ... 7.899428 8.507077
2 512.0 14.563555 ... 15.420235 16.384000
3 640.0 22.260869 ... 24.380953 24.380953
4 768.0 32.768000 ... 35.389441 34.028308
5 896.0 37.971025 ... 40.140799 40.140799
6 1024.0 49.932191 ... 53.773130 53.773130
7 1152.0 44.566925 ... 48.161033 47.396572
8 1280.0 51.200001 ... 57.690139 57.690139
9 1408.0 64.138541 ... 69.009825 67.305878
10 1536.0 79.526831 ... 79.526831 79.526831
11 1664.0 62.929456 ... 63.372618 62.929456
12 1792.0 72.983276 ... 63.142831 63.142831
13 1920.0 68.776119 ... 71.257735 70.892307
14 2048.0 73.262953 ... 78.398206 77.672296
15 2176.0 83.155572 ... 87.494120 86.367588
16 2304.0 68.251065 ... 77.810656 77.558029
17 2432.0 71.125224 ... 75.726318 75.320281
18 2560.0 77.833728 ... 82.331658 82.125311
19 2688.0 83.737433 ... 90.966561 89.888756
20 2816.0 80.767055 ... 83.712490 83.712490
21 2944.0 81.765436 ... 83.477440 83.899046
22 3072.0 81.943708 ... 89.030036 88.820552
23 3200.0 82.901554 ... 92.753621 88.213121
24 3328.0 80.798314 ... 82.369902 86.320498
25 3456.0 80.945348 ... 88.790274 91.200871
26 3584.0 87.127323 ... 100.017124 99.463928
27 3712.0 81.615477 ... 86.641231 88.248537
28 3840.0 82.778440 ... 87.148936 91.322872
29 3968.0 86.051653 ... 85.391135 84.270676
30 4096.0 91.366730 ... 84.149044 90.748973
M cuBLAS ... Triton Triton (+ LeakyReLU)
0 256.0 2.730667 ... 2.978909 2.978909
1 384.0 7.372800 ... 8.507077 8.507077
2 512.0 14.563555 ... 15.420235 15.420235
3 640.0 22.260869 ... 24.380953 24.380953
4 768.0 31.597714 ... 34.028308 34.028308
5 896.0 37.971025 ... 40.140799 39.025776
6 1024.0 49.932191 ... 52.428801 52.428801
7 1152.0 43.911529 ... 46.656000 46.656000
8 1280.0 49.951220 ... 56.888887 56.109587
9 1408.0 62.664092 ... 67.305878 67.305878
10 1536.0 78.643199 ... 78.643199 78.643199
11 1664.0 62.061463 ... 62.492442 62.061463
12 1792.0 71.588687 ... 62.441243 62.096267
13 1920.0 68.098521 ... 70.172588 69.467336
14 2048.0 72.005219 ... 76.608294 76.260072
15 2176.0 81.472263 ... 85.269692 84.909907
16 2304.0 67.289781 ... 76.319081 76.076024
17 2432.0 69.713308 ... 73.932798 73.738752
18 2560.0 76.204654 ... 80.709358 80.313727
19 2688.0 82.284288 ... 89.044730 89.464755
20 2816.0 80.916902 ... 82.916747 82.602666
21 2944.0 80.902653 ... 82.373605 82.373605
22 3072.0 81.005868 ... 88.060814 88.197981
23 3200.0 83.660130 ... 95.522391 94.814812
24 3328.0 82.369902 ... 84.695641 84.795401
25 3456.0 80.783132 ... 91.046379 91.200871
26 3584.0 86.291162 ... 98.591437 98.591437
27 3712.0 84.874549 ... 88.876645 88.876645
28 3840.0 83.845335 ... 92.083268 92.083268
29 3968.0 92.442373 ... 91.678389 91.472214
30 4096.0 93.206754 ... 92.627833 92.980765
[31 rows x 5 columns]
</pre></div>
</div>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 6 minutes 49.578 seconds)</p>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 6 minutes 43.515 seconds)</p>
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-03-matrix-multiplication-py">
<div class="sphx-glr-download sphx-glr-download-python docutils container">
<p><a class="reference download internal" download="" href="../../_downloads/d5fee5b55a64e47f1b5724ec39adf171/03-matrix-multiplication.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">03-matrix-multiplication.py</span></code></a></p>