[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-03-01 00:42:45 +00:00
parent 11bcbd3d04
commit 014137b675
156 changed files with 272 additions and 272 deletions

View File

@@ -194,36 +194,36 @@ to download the full example code</p>
<p class="sphx-glr-script-out">Out:</p>
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>layer-norm-backward:
N Triton Torch Apex
0 1024.0 307.200008 99.902435 311.088617
1 1536.0 351.085717 133.083026 341.333333
0 1024.0 307.200008 99.497980 307.200008
1 1536.0 347.773587 133.083026 338.201833
2 2048.0 423.724127 159.067963 321.254900
3 2560.0 451.764698 182.857144 323.368411
4 3072.0 515.580429 191.501303 319.168834
5 3584.0 551.384634 208.271186 310.527060
6 4096.0 568.231237 220.412561 299.707322
7 4608.0 504.986315 232.825259 286.507772
8 5120.0 531.948056 244.294240 286.433562
9 5632.0 542.843364 244.869560 291.939522
10 6144.0 552.269672 251.631408 288.000001
11 6656.0 537.858601 255.590406 286.793541
12 7168.0 516.612607 254.485198 278.368936
13 7680.0 487.619051 266.743841 284.884090
14 8192.0 467.002371 257.003920 276.912679
15 8704.0 418.629245 267.815384 286.158893
16 9216.0 432.000001 273.404206 289.887291
17 9728.0 442.181815 280.615388 289.667485
18 10240.0 448.467168 287.102804 290.840246
19 10752.0 428.651173 246.464170 289.616170
20 11264.0 427.746848 246.432094 286.980888
21 11776.0 421.826879 249.888595 288.981596
22 12288.0 417.131525 254.893699 294.617366
23 12800.0 415.696898 253.674644 290.359162
24 13312.0 410.125805 252.559690 289.653667
25 13824.0 402.640783 257.190689 292.056329
26 14336.0 396.387109 255.240352 289.129416
27 14848.0 383.174202 257.293872 287.844912
28 15360.0 374.253788 258.513318 286.879376
29 15872.0 368.402336 262.347108 290.120338
3 2560.0 451.764698 183.402991 330.322572
4 3072.0 508.468972 193.005236 315.076914
5 3584.0 547.872604 208.271186 308.301075
6 4096.0 564.965515 220.412561 301.546004
7 4608.0 504.986315 232.825259 291.799469
8 5120.0 529.655159 240.941184 285.767451
9 5632.0 547.238891 241.371422 288.820505
10 6144.0 552.269672 249.502530 286.879370
11 6656.0 536.053693 254.369423 284.242007
12 7168.0 515.065851 252.616738 276.134819
13 7680.0 486.332448 263.314295 280.547947
14 8192.0 463.698115 263.196793 280.467910
15 8704.0 416.958106 265.096445 283.440968
16 9216.0 431.157889 271.724806 287.625496
17 9728.0 441.345926 280.615388 288.593329
18 10240.0 446.836366 285.767451 289.469963
19 10752.0 429.364408 246.464170 289.941565
20 11264.0 423.724120 244.869560 284.864065
21 11776.0 421.826879 250.109737 289.573776
22 12288.0 419.504980 253.796902 294.323369
23 12800.0 415.696898 253.256381 287.640454
24 13312.0 409.599999 253.160074 290.707920
25 13824.0 405.098897 256.593977 291.799461
26 14336.0 397.761846 254.673567 287.438588
27 14848.0 381.942121 256.922861 287.612590
28 15360.0 376.932517 259.971797 288.676598
29 15872.0 367.691129 264.717162 292.796308
</pre></div>
</div>
<div class="line-block">
@@ -487,7 +487,7 @@ to download the full example code</p>
<span class="n">bench_layer_norm</span><span class="o">.</span><span class="n">run</span><span class="p">(</span><span class="n">save_path</span><span class="o">=</span><span class="s1">&#39;.&#39;</span><span class="p">,</span> <span class="n">print_data</span><span class="o">=</span><span class="kc">True</span><span class="p">)</span>
</pre></div>
</div>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 11.369 seconds)</p>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 11.911 seconds)</p>
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-05-layer-norm-py">
<div class="sphx-glr-download sphx-glr-download-python docutils container">
<p><a class="reference download internal" download="" href="../../_downloads/935c0dd0fbeb4b2e69588471cbb2d4b2/05-layer-norm.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">05-layer-norm.py</span></code></a></p>