[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-02-12 00:39:34 +00:00
parent 7c47f3e043
commit 2f5658c61f
158 changed files with 296 additions and 296 deletions

View File

@@ -564,42 +564,42 @@ torch_output=tensor([[ 1.1045, -36.9688, 31.4688, ..., -11.3906, 24.4531, -3
<p class="sphx-glr-script-out">Out:</p>
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>matmul-performance:
M cuBLAS ... Triton Triton (+ LeakyReLU)
0 256.0 2.730667 ... 3.276800 2.978909
0 256.0 2.978909 ... 2.978909 2.978909
1 384.0 7.372800 ... 8.507077 7.899428
2 512.0 14.563555 ... 16.384000 16.384000
3 640.0 22.260869 ... 24.380953 24.380953
3 640.0 23.272727 ... 24.380953 24.380953
4 768.0 32.768000 ... 34.028308 34.028308
5 896.0 39.025776 ... 39.025776 39.025776
5 896.0 37.971025 ... 40.140799 39.025776
6 1024.0 49.932191 ... 52.428801 52.428801
7 1152.0 45.242181 ... 46.656000 46.656000
8 1280.0 51.200001 ... 56.888887 56.888887
9 1408.0 64.138541 ... 67.305878 66.485074
10 1536.0 79.526831 ... 79.526831 79.526831
11 1664.0 62.929456 ... 62.492442 62.061463
12 1792.0 72.983276 ... 72.047592 72.047592
13 1920.0 68.776119 ... 70.172588 70.172588
14 2048.0 73.262953 ... 76.608294 76.260072
15 2176.0 83.155572 ... 85.998493 85.632545
16 2304.0 68.643310 ... 76.319081 75.834511
17 2432.0 71.125224 ... 74.521127 84.621881
18 2560.0 77.833728 ... 81.108913 81.108913
19 2688.0 83.186525 ... 89.888756 89.464755
20 2816.0 83.074685 ... 82.916747 81.293956
21 2944.0 81.967162 ... 82.237674 81.298583
22 3072.0 81.238312 ... 88.612060 88.890270
23 3200.0 84.656085 ... 95.665176 94.674553
24 3328.0 83.468170 ... 82.843841 84.695641
25 3456.0 81.683457 ... 90.281712 90.994998
26 3584.0 84.986191 ... 87.296493 95.047985
27 3712.0 82.355598 ... 88.718781 83.040189
28 3840.0 84.228485 ... 91.701494 85.465227
29 3968.0 91.062642 ... 85.271796 89.394823
30 4096.0 87.352901 ... 88.011627 87.154371
10 1536.0 80.430545 ... 79.526831 78.643199
11 1664.0 62.929456 ... 62.492442 62.492442
12 1792.0 72.512412 ... 72.047592 72.047592
13 1920.0 68.776119 ... 70.172588 70.530615
14 2048.0 73.584279 ... 76.608294 76.608294
15 2176.0 83.500614 ... 86.367588 85.998493
16 2304.0 68.643310 ... 76.809875 76.563695
17 2432.0 71.487187 ... 74.719317 84.621881
18 2560.0 77.283019 ... 81.108913 81.108913
19 2688.0 83.369354 ... 89.464755 89.676257
20 2816.0 81.981598 ... 83.392363 82.916747
21 2944.0 81.967162 ... 81.034195 81.832567
22 3072.0 82.301023 ... 87.924073 88.612060
23 3200.0 78.816219 ... 94.814812 95.380032
24 3328.0 84.101981 ... 82.275764 85.500351
25 3456.0 81.849303 ... 83.893412 90.281712
26 3584.0 87.211821 ... 98.537414 90.367227
27 3712.0 80.627396 ... 87.132441 87.018592
28 3840.0 84.940091 ... 92.236860 84.548438
29 3968.0 92.302520 ... 84.154440 90.724116
30 4096.0 86.478753 ... 90.169784 87.097813
[31 rows x 5 columns]
</pre></div>
</div>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 5 minutes 55.972 seconds)</p>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 5 minutes 57.081 seconds)</p>
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-03-matrix-multiplication-py">
<div class="sphx-glr-download sphx-glr-download-python docutils container">
<p><a class="reference download internal" download="" href="../../_downloads/d5fee5b55a64e47f1b5724ec39adf171/03-matrix-multiplication.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">03-matrix-multiplication.py</span></code></a></p>