[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-02-13 00:39:42 +00:00
parent 2f5658c61f
commit 13537582ad
159 changed files with 303 additions and 303 deletions

View File

@@ -194,36 +194,36 @@ to download the full example code</p>
<p class="sphx-glr-script-out">Out:</p>
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>layer-norm-backward:
N Triton Torch Apex
0 1024.0 311.088617 99.497980 311.088617
1 1536.0 351.085717 133.083026 344.523365
2 2048.0 423.724127 158.554837 332.108094
3 2560.0 458.507457 182.857144 330.322572
4 3072.0 515.580429 191.501303 316.429186
5 3584.0 547.872604 208.271186 311.652167
6 4096.0 568.231237 219.919464 297.890900
7 4608.0 498.162157 232.825259 288.751954
8 5120.0 527.381977 243.809526 289.811322
9 5632.0 540.671974 245.313973 291.310338
10 6144.0 548.163546 251.202731 286.879370
11 6656.0 534.260858 255.590406 286.279570
12 7168.0 516.612607 254.109315 278.368936
13 7680.0 487.619051 266.743841 284.884090
14 8192.0 467.002371 257.003920 276.912679
15 8704.0 416.958106 267.815384 285.767450
16 9216.0 431.157889 273.742580 289.507855
17 9728.0 439.683593 280.278512 289.308559
18 10240.0 446.025405 287.102804 290.840246
19 10752.0 431.518385 246.699797 288.967529
20 11264.0 428.424741 246.432094 286.676558
21 11776.0 422.457417 250.109737 288.981596
22 12288.0 419.504980 254.673582 294.029924
23 12800.0 414.574901 253.884294 289.265522
24 13312.0 412.242569 252.559690 289.653667
25 13824.0 405.594132 257.390218 292.056329
26 14336.0 395.475867 255.240352 289.129416
27 14848.0 385.662341 257.293872 288.077610
28 15360.0 374.443863 258.422707 287.775181
29 15872.0 366.629453 261.986243 290.784741
0 1024.0 311.088617 98.303995 303.407414
1 1536.0 351.085717 134.050910 341.333333
2 2048.0 420.102553 161.684218 325.509933
3 2560.0 461.954908 181.238943 325.079368
4 3072.0 511.999982 192.501302 319.168834
5 3584.0 551.384634 208.271186 311.652167
6 4096.0 568.231237 219.919464 299.707322
7 4608.0 500.416301 232.825259 286.507772
8 5120.0 525.128191 242.366855 285.104413
9 5632.0 540.671974 243.107920 289.438969
10 6144.0 544.118087 248.242431 285.767458
11 6656.0 530.710976 256.000009 285.767438
12 7168.0 505.976473 260.654538 286.242939
13 7680.0 481.253256 262.564106 279.272719
14 8192.0 462.607053 267.130429 284.526763
15 8704.0 417.791980 267.815384 284.987724
16 9216.0 430.319054 272.059034 288.751954
17 9728.0 438.033784 280.278512 289.667485
18 10240.0 447.650282 286.433562 290.496460
19 10752.0 428.651173 247.172406 290.922209
20 11264.0 429.104745 245.536784 286.676558
21 11776.0 422.457417 249.667843 288.686414
22 12288.0 420.102570 254.453844 294.323369
23 12800.0 414.574901 253.465340 289.811310
24 13312.0 412.242569 252.759501 289.916513
25 13824.0 406.090579 257.190689 291.799461
26 14336.0 395.930964 254.297107 286.959121
27 14848.0 386.498925 257.665934 289.246765
28 15360.0 373.495460 257.790220 287.102804
29 15872.0 370.192407 261.626369 289.899545
</pre></div>
</div>
<div class="line-block">
@@ -477,7 +477,7 @@ to download the full example code</p>
<span class="n">bench_layer_norm</span><span class="o">.</span><span class="n">run</span><span class="p">(</span><span class="n">save_path</span><span class="o">=</span><span class="s1">&#39;.&#39;</span><span class="p">,</span> <span class="n">print_data</span><span class="o">=</span><span class="kc">True</span><span class="p">)</span>
</pre></div>
</div>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 12.820 seconds)</p>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 12.343 seconds)</p>
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-05-layer-norm-py">
<div class="sphx-glr-download sphx-glr-download-python docutils container">
<p><a class="reference download internal" download="" href="../../_downloads/935c0dd0fbeb4b2e69588471cbb2d4b2/05-layer-norm.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">05-layer-norm.py</span></code></a></p>