[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-03-03 00:42:21 +00:00
parent 014137b675
commit 80b92a0d2d
158 changed files with 272 additions and 272 deletions

View File

@@ -194,36 +194,36 @@ to download the full example code</p>
<p class="sphx-glr-script-out">Out:</p>
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>layer-norm-backward:
N Triton Torch Apex
0 1024.0 307.200008 99.497980 307.200008
1 1536.0 347.773587 133.083026 338.201833
2 2048.0 423.724127 159.067963 321.254900
3 2560.0 451.764698 183.402991 330.322572
4 3072.0 508.468972 193.005236 315.076914
5 3584.0 547.872604 208.271186 308.301075
6 4096.0 564.965515 220.412561 301.546004
0 1024.0 311.088617 98.303995 303.407414
1 1536.0 347.773587 134.050910 341.333333
2 2048.0 420.102553 161.684218 325.509933
3 2560.0 455.111129 181.238943 326.808501
4 3072.0 511.999982 191.999993 317.793096
5 3584.0 547.872604 207.768111 310.527060
6 4096.0 564.965515 219.919464 295.207204
7 4608.0 504.986315 232.825259 291.799469
8 5120.0 529.655159 240.941184 285.767451
9 5632.0 547.238891 241.371422 288.820505
10 6144.0 552.269672 249.502530 286.879370
11 6656.0 536.053693 254.369423 284.242007
12 7168.0 515.065851 252.616738 276.134819
13 7680.0 486.332448 263.314295 280.547947
14 8192.0 463.698115 263.196793 280.467910
15 8704.0 416.958106 265.096445 283.440968
16 9216.0 431.157889 271.724806 287.625496
17 9728.0 441.345926 280.615388 288.593329
18 10240.0 446.836366 285.767451 289.469963
19 10752.0 429.364408 246.464170 289.941565
20 11264.0 423.724120 244.869560 284.864065
21 11776.0 421.826879 250.109737 289.573776
22 12288.0 419.504980 253.796902 294.323369
23 12800.0 415.696898 253.256381 287.640454
24 13312.0 409.599999 253.160074 290.707920
25 13824.0 405.098897 256.593977 291.799461
26 14336.0 397.761846 254.673567 287.438588
27 14848.0 381.942121 256.922861 287.612590
28 15360.0 376.932517 259.971797 288.676598
29 15872.0 367.691129 264.717162 292.796308
8 5120.0 527.381977 242.845844 288.450695
9 5632.0 542.843364 243.107920 288.820505
10 6144.0 546.133354 248.661056 286.879370
11 6656.0 532.479975 256.000009 285.767438
12 7168.0 505.976473 260.654538 286.242939
13 7680.0 481.253256 262.190612 278.429013
14 8192.0 463.698115 267.130429 284.939124
15 8704.0 417.791980 267.815384 284.987724
16 9216.0 432.845409 272.394084 288.751954
17 9728.0 439.683593 280.278512 289.667485
18 10240.0 448.467168 286.433562 290.153487
19 10752.0 425.821771 247.172406 290.922209
20 11264.0 427.071098 245.760001 286.676558
21 11776.0 423.724129 249.667843 288.981596
22 12288.0 419.504980 254.673582 294.323369
23 12800.0 413.458944 253.674644 288.180121
24 13312.0 412.242569 252.859526 289.916513
25 13824.0 405.594132 257.190689 292.056329
26 14336.0 394.568805 254.297107 286.719986
27 14848.0 386.498925 257.665934 289.246765
28 15360.0 373.117425 257.970599 286.211174
29 15872.0 371.274849 261.806182 289.899545
</pre></div>
</div>
<div class="line-block">
@@ -487,7 +487,7 @@ to download the full example code</p>
<span class="n">bench_layer_norm</span><span class="o">.</span><span class="n">run</span><span class="p">(</span><span class="n">save_path</span><span class="o">=</span><span class="s1">&#39;.&#39;</span><span class="p">,</span> <span class="n">print_data</span><span class="o">=</span><span class="kc">True</span><span class="p">)</span>
</pre></div>
</div>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 11.911 seconds)</p>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 12.550 seconds)</p>
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-05-layer-norm-py">
<div class="sphx-glr-download sphx-glr-download-python docutils container">
<p><a class="reference download internal" download="" href="../../_downloads/935c0dd0fbeb4b2e69588471cbb2d4b2/05-layer-norm.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">05-layer-norm.py</span></code></a></p>