[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-04-16 00:44:44 +00:00
parent 824d060dfb
commit 9b0ee317d9
160 changed files with 365 additions and 380 deletions

View File

@@ -194,36 +194,36 @@ to download the full example code</p>
<p class="sphx-glr-script-out">Out:</p>
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>layer-norm-backward:
N Triton Torch Apex
0 1024.0 356.173905 98.303995 307.200008
1 1536.0 396.387087 133.565214 341.333333
2 2048.0 481.882362 160.627450 325.509933
3 2560.0 451.764698 180.175950 321.675394
4 3072.0 511.999982 189.046153 316.429186
5 3584.0 547.872604 206.769233 308.301075
6 4096.0 558.545450 218.939860 298.796351
7 4608.0 491.520008 231.849059 286.507772
8 5120.0 518.481012 240.469672 283.133649
9 5632.0 532.157453 241.371422 288.204696
10 6144.0 542.117638 249.502530 286.322318
11 6656.0 532.479975 253.561895 284.242007
12 7168.0 507.469040 254.109315 277.919225
13 7680.0 486.332448 263.314295 280.547947
14 8192.0 464.794337 263.903346 277.694924
15 8704.0 406.412440 263.093202 280.774186
16 9216.0 418.909088 270.065931 286.507772
17 9728.0 427.604376 281.291575 289.667485
18 10240.0 434.973455 284.115604 288.450695
19 10752.0 423.724151 244.827326 289.291486
20 11264.0 423.061049 242.019694 282.482755
21 11776.0 417.465304 247.915800 287.219500
22 12288.0 414.202242 252.601276 293.737063
23 12800.0 410.146863 252.424003 288.993430
24 13312.0 406.991092 252.759501 289.916513
25 13824.0 404.112047 255.408777 291.031592
26 14336.0 395.475867 251.692749 284.821192
27 14848.0 383.174202 255.816222 287.612590
28 15360.0 378.480483 259.058326 289.129401
29 15872.0 369.832994 260.196726 288.800600
0 1024.0 356.173905 99.497980 315.076934
1 1536.0 409.599994 134.050910 344.523365
2 2048.0 491.520012 159.067963 321.254900
3 2560.0 461.954908 182.314537 325.079368
4 3072.0 519.211251 191.501303 320.556515
5 3584.0 554.941930 207.768111 309.410081
6 4096.0 564.965515 220.907859 300.623865
7 4608.0 500.416301 232.336141 287.251954
8 5120.0 529.655159 243.809526 289.129408
9 5632.0 540.671974 244.426754 291.310338
10 6144.0 552.269672 251.202731 288.000001
11 6656.0 534.260858 255.590406 286.279570
12 7168.0 512.000004 253.734520 277.919225
13 7680.0 487.619051 266.743841 284.884090
14 8192.0 468.114289 258.354805 278.481578
15 8704.0 415.300208 267.472468 285.377055
16 9216.0 428.651187 272.394084 289.887291
17 9728.0 438.033784 279.942444 288.950501
18 10240.0 445.217381 287.102804 290.153487
19 10752.0 427.231788 246.935876 289.941565
20 11264.0 428.424741 245.536784 286.069848
21 11776.0 418.702211 249.667843 288.981596
22 12288.0 414.784810 254.453844 294.323369
23 12800.0 410.695192 254.094291 288.180121
24 13312.0 410.125805 252.559690 289.129403
25 13824.0 404.604870 256.991469 291.799461
26 14336.0 396.387109 255.809666 288.886653
27 14848.0 386.498925 257.665934 288.777966
28 15360.0 378.869469 258.513318 286.656296
29 15872.0 372.000001 261.626369 290.562936
</pre></div>
</div>
<div class="line-block">
@@ -487,7 +487,7 @@ to download the full example code</p>
<span class="n">bench_layer_norm</span><span class="o">.</span><span class="n">run</span><span class="p">(</span><span class="n">save_path</span><span class="o">=</span><span class="s1">&#39;.&#39;</span><span class="p">,</span> <span class="n">print_data</span><span class="o">=</span><span class="kc">True</span><span class="p">)</span>
</pre></div>
</div>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 15.414 seconds)</p>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 14.228 seconds)</p>
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-05-layer-norm-py">
<div class="sphx-glr-download sphx-glr-download-python docutils container">
<p><a class="reference download internal" download="" href="../../_downloads/935c0dd0fbeb4b2e69588471cbb2d4b2/05-layer-norm.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">05-layer-norm.py</span></code></a></p>