[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-04-24 00:44:07 +00:00
parent 31dd4ab60e
commit 1581cf9d79
158 changed files with 328 additions and 328 deletions

View File

@@ -194,36 +194,36 @@ to download the full example code</p>
<p class="sphx-glr-script-out">Out:</p>
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>layer-norm-backward:
N Triton Torch Apex
0 1024.0 303.407414 98.698793 307.200008
1 1536.0 344.523365 134.050910 341.333333
2 2048.0 420.102553 160.104230 321.254900
3 2560.0 455.111129 181.775141 326.808501
4 3072.0 498.162140 190.511624 313.736171
5 3584.0 540.981122 205.779899 310.527060
6 4096.0 558.545450 220.412561 300.623865
7 4608.0 489.345125 231.849059 287.999990
8 5120.0 520.677950 241.889751 285.104413
9 5632.0 530.070605 240.941167 287.591490
10 6144.0 538.160602 249.502530 286.322318
11 6656.0 532.479975 254.369423 285.257135
12 7168.0 513.528374 254.109315 278.368936
13 7680.0 486.332448 264.068761 282.699379
14 8192.0 460.440290 269.695465 282.889211
15 8704.0 408.000001 261.774429 281.530996
16 9216.0 420.501910 271.724806 289.507855
17 9728.0 429.176463 279.607181 288.950501
18 10240.0 439.642212 284.774046 288.789653
19 10752.0 424.421071 245.760009 290.267711
20 11264.0 425.725982 241.587127 284.564206
21 11776.0 418.702211 249.227509 288.391833
22 12288.0 420.701865 252.709503 292.571431
23 12800.0 413.458944 253.465340 288.180121
24 13312.0 411.181478 250.972500 289.129403
25 13824.0 403.620451 256.593977 291.543045
26 14336.0 397.761846 253.360829 287.438588
27 14848.0 386.080180 259.353715 291.137253
28 15360.0 378.480483 259.240506 288.676598
29 15872.0 366.629453 260.196726 289.019722
0 1024.0 307.200008 97.912354 299.707322
1 1536.0 347.773587 134.050910 338.201833
2 2048.0 423.724127 161.154101 323.368435
3 2560.0 465.454542 180.705883 326.808501
4 3072.0 511.999982 192.501302 320.556515
5 3584.0 551.384634 208.271186 309.410081
6 4096.0 568.231237 220.412561 291.703260
7 4608.0 495.928261 232.825259 290.267724
8 5120.0 525.128191 242.366855 284.444444
9 5632.0 538.517949 243.107920 288.820505
10 6144.0 542.117638 248.661056 286.879370
11 6656.0 528.953642 255.590406 285.257135
12 7168.0 505.976473 260.260201 285.293536
13 7680.0 485.052616 262.938666 280.121579
14 8192.0 460.440290 266.046015 284.526763
15 8704.0 416.127506 267.472468 284.987724
16 9216.0 429.483477 271.391419 288.375482
17 9728.0 437.213490 280.278512 290.027323
18 10240.0 446.025405 286.100109 289.811322
19 10752.0 430.079980 246.935876 290.267711
20 11264.0 429.786952 245.536784 286.980888
21 11776.0 423.089806 249.667843 289.277383
22 12288.0 419.504980 254.453844 294.617366
23 12800.0 414.016170 253.884294 288.180121
24 13312.0 412.242569 252.959629 290.443638
25 13824.0 406.090579 257.390218 292.056329
26 14336.0 396.158905 254.862216 287.198654
27 14848.0 386.498925 257.665934 289.481735
28 15360.0 376.163261 257.790220 286.656296
29 15872.0 368.402336 261.626369 290.562936
</pre></div>
</div>
<div class="line-block">
@@ -477,7 +477,7 @@ to download the full example code</p>
<span class="n">bench_layer_norm</span><span class="o">.</span><span class="n">run</span><span class="p">(</span><span class="n">save_path</span><span class="o">=</span><span class="s1">&#39;.&#39;</span><span class="p">,</span> <span class="n">print_data</span><span class="o">=</span><span class="kc">True</span><span class="p">)</span>
</pre></div>
</div>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 13.422 seconds)</p>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 11.521 seconds)</p>
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-05-layer-norm-py">
<div class="sphx-glr-download sphx-glr-download-python docutils container">
<p><a class="reference download internal" download="" href="../../_downloads/935c0dd0fbeb4b2e69588471cbb2d4b2/05-layer-norm.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">05-layer-norm.py</span></code></a></p>