[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-04-08 00:44:05 +00:00
parent 80b92a0d2d
commit 0c570c178d
173 changed files with 401 additions and 386 deletions

View File

@@ -194,36 +194,36 @@ to download the full example code</p>
<p class="sphx-glr-script-out">Out:</p>
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>layer-norm-backward:
N Triton Torch Apex
0 1024.0 311.088617 98.303995 303.407414
1 1536.0 347.773587 134.050910 341.333333
2 2048.0 420.102553 161.684218 325.509933
3 2560.0 455.111129 181.238943 326.808501
4 3072.0 511.999982 191.999993 317.793096
5 3584.0 547.872604 207.768111 310.527060
6 4096.0 564.965515 219.919464 295.207204
7 4608.0 504.986315 232.825259 291.799469
8 5120.0 527.381977 242.845844 288.450695
9 5632.0 542.843364 243.107920 288.820505
10 6144.0 546.133354 248.661056 286.879370
11 6656.0 532.479975 256.000009 285.767438
12 7168.0 505.976473 260.654538 286.242939
13 7680.0 481.253256 262.190612 278.429013
14 8192.0 463.698115 267.130429 284.939124
15 8704.0 417.791980 267.815384 284.987724
16 9216.0 432.845409 272.394084 288.751954
17 9728.0 439.683593 280.278512 289.667485
18 10240.0 448.467168 286.433562 290.153487
19 10752.0 425.821771 247.172406 290.922209
20 11264.0 427.071098 245.760001 286.676558
21 11776.0 423.724129 249.667843 288.981596
22 12288.0 419.504980 254.673582 294.323369
23 12800.0 413.458944 253.674644 288.180121
24 13312.0 412.242569 252.859526 289.916513
25 13824.0 405.594132 257.190689 292.056329
26 14336.0 394.568805 254.297107 286.719986
27 14848.0 386.498925 257.665934 289.246765
28 15360.0 373.117425 257.970599 286.211174
29 15872.0 371.274849 261.806182 289.899545
0 1024.0 356.173905 99.497980 315.076934
1 1536.0 405.098894 134.050910 344.523365
2 2048.0 486.653476 159.067963 321.254900
3 2560.0 458.507457 182.314537 326.808501
4 3072.0 515.580429 191.501303 319.168834
5 3584.0 551.384634 207.768111 307.199992
6 4096.0 568.231237 220.907859 293.444785
7 4608.0 502.690905 232.336141 290.267724
8 5120.0 527.381977 243.326731 287.102804
9 5632.0 540.671974 244.426754 291.310338
10 6144.0 548.163546 251.202731 288.000001
11 6656.0 532.479975 255.590406 286.279570
12 7168.0 510.480705 253.734520 277.919225
13 7680.0 487.619051 266.743841 284.884090
14 8192.0 468.114289 258.694729 278.481578
15 8704.0 415.300208 267.472468 284.987724
16 9216.0 429.483477 272.394084 290.077383
17 9728.0 438.033784 280.278512 288.950501
18 10240.0 443.610086 287.102804 290.153487
19 10752.0 426.525614 246.699797 290.267711
20 11264.0 427.071098 245.536784 286.069848
21 11776.0 418.702211 249.447482 288.981596
22 12288.0 414.784810 254.673582 294.323369
23 12800.0 410.146863 254.094291 288.180121
24 13312.0 409.599999 252.161013 289.129403
25 13824.0 404.112047 257.190689 291.799461
26 14336.0 395.930964 256.000002 289.129416
27 14848.0 385.662341 257.479779 288.777966
28 15360.0 380.433442 258.332158 286.656296
29 15872.0 372.363640 261.806182 290.562936
</pre></div>
</div>
<div class="line-block">
@@ -487,7 +487,7 @@ to download the full example code</p>
<span class="n">bench_layer_norm</span><span class="o">.</span><span class="n">run</span><span class="p">(</span><span class="n">save_path</span><span class="o">=</span><span class="s1">&#39;.&#39;</span><span class="p">,</span> <span class="n">print_data</span><span class="o">=</span><span class="kc">True</span><span class="p">)</span>
</pre></div>
</div>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 12.550 seconds)</p>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 14.583 seconds)</p>
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-05-layer-norm-py">
<div class="sphx-glr-download sphx-glr-download-python docutils container">
<p><a class="reference download internal" download="" href="../../_downloads/935c0dd0fbeb4b2e69588471cbb2d4b2/05-layer-norm.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">05-layer-norm.py</span></code></a></p>