[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-04-27 00:47:37 +00:00
parent b0a569b724
commit 9bf9b2f3d8
158 changed files with 302 additions and 302 deletions

View File

@@ -194,36 +194,36 @@ to download the full example code</p>
<p class="sphx-glr-script-out">Out:</p>
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>layer-norm-backward:
N Triton Torch Apex
0 1024.0 311.088617 98.698793 299.707322
1 1536.0 351.085717 133.083026 338.201833
2 2048.0 420.102553 162.217818 325.509933
3 2560.0 461.954908 183.402991 326.808501
4 3072.0 511.999982 192.501302 315.076914
5 3584.0 547.872604 207.768111 307.199992
6 4096.0 561.737163 221.405403 294.323343
7 4608.0 495.928261 232.336141 294.127651
8 5120.0 520.677950 243.326731 287.102804
9 5632.0 534.260858 241.371422 286.980888
10 6144.0 540.131844 250.349744 287.438593
11 6656.0 527.207907 254.775119 284.748652
12 7168.0 507.469040 254.109315 278.820105
13 7680.0 482.513091 263.690977 280.975614
14 8192.0 459.364487 265.327937 280.467910
15 8704.0 415.300208 264.091015 281.530996
16 9216.0 429.483477 270.065931 287.625496
17 9728.0 435.582084 280.278512 289.667485
18 10240.0 447.650282 285.104413 289.129408
19 10752.0 432.241202 245.994291 289.616170
20 11264.0 428.424741 242.671458 283.371073
21 11776.0 423.089806 248.460649 288.391833
22 12288.0 418.909088 253.796902 294.323369
23 12800.0 415.135142 253.465340 289.265522
24 13312.0 411.711355 252.959629 291.769875
25 13824.0 403.620451 257.190689 291.799461
26 14336.0 399.146178 253.734520 287.919661
27 14848.0 386.498925 258.039095 290.662311
28 15360.0 377.318326 260.338991 288.450715
29 15872.0 367.691129 263.071829 291.675334
0 1024.0 307.200008 99.497980 315.076934
1 1536.0 347.773587 133.083026 344.523365
2 2048.0 423.724127 158.045011 334.367350
3 2560.0 461.954908 182.314537 330.322572
4 3072.0 515.580429 191.501303 316.429186
5 3584.0 551.384634 207.768111 310.527060
6 4096.0 564.965515 219.919464 298.796351
7 4608.0 498.162157 231.849059 290.267724
8 5120.0 529.655159 243.809526 289.811322
9 5632.0 540.671974 244.426754 291.310338
10 6144.0 546.133354 251.202731 288.000001
11 6656.0 534.260858 256.000009 286.793541
12 7168.0 516.612607 253.734520 277.470965
13 7680.0 490.212752 266.358392 284.444450
14 8192.0 463.698115 258.694729 277.694924
15 8704.0 416.127506 267.130429 285.767450
16 9216.0 430.319054 273.066667 289.129410
17 9728.0 439.683593 280.278512 289.308559
18 10240.0 446.836366 286.767793 287.438599
19 10752.0 432.241202 246.699797 289.941565
20 11264.0 429.104745 246.432094 287.897767
21 11776.0 422.457417 249.667843 289.573776
22 12288.0 420.102570 254.673582 294.323369
23 12800.0 414.016170 253.884294 290.359162
24 13312.0 410.652963 253.360814 289.391298
25 13824.0 406.588243 257.190689 292.056329
26 14336.0 395.475867 255.619613 288.402346
27 14848.0 385.245405 257.015513 288.544136
28 15360.0 376.932517 258.513318 288.902809
29 15872.0 369.474279 261.626369 289.679087
</pre></div>
</div>
<div class="line-block">
@@ -477,7 +477,7 @@ to download the full example code</p>
<span class="n">bench_layer_norm</span><span class="o">.</span><span class="n">run</span><span class="p">(</span><span class="n">save_path</span><span class="o">=</span><span class="s1">&#39;.&#39;</span><span class="p">,</span> <span class="n">print_data</span><span class="o">=</span><span class="kc">True</span><span class="p">)</span>
</pre></div>
</div>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 11.756 seconds)</p>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 10.422 seconds)</p>
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-05-layer-norm-py">
<div class="sphx-glr-download sphx-glr-download-python docutils container">
<p><a class="reference download internal" download="" href="../../_downloads/935c0dd0fbeb4b2e69588471cbb2d4b2/05-layer-norm.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">05-layer-norm.py</span></code></a></p>