[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-02-17 00:40:30 +00:00
parent c1c43fcf06
commit 0cd3b626c5
158 changed files with 258 additions and 258 deletions

View File

@@ -194,36 +194,36 @@ to download the full example code</p>
<p class="sphx-glr-script-out">Out:</p>
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>layer-norm-backward:
N Triton Torch Apex
0 1024.0 311.088617 98.303995 303.407414
1 1536.0 351.085717 133.565214 341.333333
2 2048.0 420.102553 162.217818 327.679984
3 2560.0 465.454542 182.314537 332.108113
4 3072.0 511.999982 191.999993 317.793096
5 3584.0 551.384634 207.768111 309.410081
6 4096.0 568.231237 220.412561 300.623865
7 4608.0 500.416301 233.316456 290.267724
8 5120.0 525.128191 242.845844 287.102804
9 5632.0 538.517949 243.107920 289.438969
10 6144.0 544.118087 250.775512 287.438593
11 6656.0 536.053693 255.590406 285.257135
12 7168.0 512.000004 256.381525 280.639473
13 7680.0 486.332448 264.447629 279.272719
14 8192.0 464.794337 268.223740 283.296835
15 8704.0 416.958106 264.091015 280.021457
16 9216.0 430.319054 272.729961 289.887291
17 9728.0 438.857162 281.630872 289.667485
18 10240.0 446.836366 286.100109 288.112552
19 10752.0 426.525614 247.884724 292.240100
20 11264.0 431.157894 243.765566 283.668421
21 11776.0 423.724129 249.888595 288.391833
22 12288.0 420.701865 253.796902 294.617366
23 12800.0 414.574901 254.515329 290.359162
24 13312.0 411.181478 253.360814 289.653667
25 13824.0 407.587209 256.991469 291.543045
26 14336.0 395.930964 254.109315 286.959121
27 14848.0 384.414233 258.976744 289.717061
28 15360.0 377.704925 260.155264 290.039336
29 15872.0 369.116300 262.708969 291.452168
0 1024.0 311.088617 98.303995 307.200008
1 1536.0 354.461542 134.540150 344.523365
2 2048.0 423.724127 161.684218 334.367350
3 2560.0 465.454542 181.775141 330.322572
4 3072.0 515.580429 192.501302 320.556515
5 3584.0 551.384634 208.271186 311.652167
6 4096.0 568.231237 220.412561 298.796351
7 4608.0 500.416301 232.825259 287.251954
8 5120.0 525.128191 242.845844 285.104413
9 5632.0 542.843364 243.545956 289.438969
10 6144.0 544.118087 248.242431 286.322318
11 6656.0 532.479975 256.000009 286.279570
12 7168.0 505.976473 260.654538 286.242939
13 7680.0 481.253256 262.564106 278.850215
14 8192.0 463.698115 267.130429 284.939124
15 8704.0 417.374651 267.815384 284.987724
16 9216.0 430.319054 272.394084 288.751954
17 9728.0 438.857162 280.278512 289.667485
18 10240.0 447.650282 286.433562 290.840246
19 10752.0 428.651173 247.172406 290.922209
20 11264.0 429.786952 245.760001 286.676558
21 11776.0 422.457417 249.888595 288.981596
22 12288.0 420.102570 254.673582 294.617366
23 12800.0 414.574901 253.674644 288.450715
24 13312.0 412.242569 252.659556 289.916513
25 13824.0 406.090579 257.390218 292.056329
26 14336.0 396.387109 254.297107 286.959121
27 14848.0 386.498925 257.665934 289.481735
28 15360.0 373.495460 257.970599 287.550706
29 15872.0 370.192407 261.806182 289.899545
</pre></div>
</div>
<div class="line-block">
@@ -477,7 +477,7 @@ to download the full example code</p>
<span class="n">bench_layer_norm</span><span class="o">.</span><span class="n">run</span><span class="p">(</span><span class="n">save_path</span><span class="o">=</span><span class="s1">&#39;.&#39;</span><span class="p">,</span> <span class="n">print_data</span><span class="o">=</span><span class="kc">True</span><span class="p">)</span>
</pre></div>
</div>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 12.467 seconds)</p>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 13.083 seconds)</p>
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-05-layer-norm-py">
<div class="sphx-glr-download sphx-glr-download-python docutils container">
<p><a class="reference download internal" download="" href="../../_downloads/935c0dd0fbeb4b2e69588471cbb2d4b2/05-layer-norm.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">05-layer-norm.py</span></code></a></p>