[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-02-13 00:39:42 +00:00
parent 2f5658c61f
commit 13537582ad
159 changed files with 303 additions and 303 deletions

View File

@@ -194,36 +194,36 @@ to download the full example code</p>
<p class="sphx-glr-script-out">Out:</p>
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>layer-norm-backward:
N Triton Torch Apex
0 1024.0 307.200008 97.912354 303.407414
0 1024.0 311.088617 98.303995 307.200008
1 1536.0 347.773587 134.540150 341.333333
2 2048.0 420.102553 161.684218 323.368435
3 2560.0 458.507457 181.238943 330.322572
2 2048.0 420.102553 161.684218 334.367350
3 2560.0 458.507457 181.775141 330.322572
4 3072.0 511.999982 192.501302 320.556515
5 3584.0 547.872604 208.271186 312.785456
6 4096.0 568.231237 220.907859 300.623865
7 4608.0 507.302750 232.825259 287.999990
8 5120.0 527.381977 242.845844 285.104413
9 5632.0 540.671974 241.371422 288.204696
10 6144.0 548.163546 250.349744 287.438593
11 6656.0 534.260858 256.000009 286.279570
12 7168.0 512.000004 255.240352 280.182402
13 7680.0 485.052616 263.690977 277.172933
14 8192.0 463.698115 268.223740 281.673345
15 8704.0 418.629245 266.109560 282.291896
16 9216.0 432.845409 272.394084 288.375482
17 9728.0 439.683593 278.606213 287.173424
18 10240.0 446.025405 286.767793 288.112552
19 10752.0 423.724151 244.827326 288.321786
20 11264.0 426.397479 245.983625 287.285864
21 11776.0 421.198220 247.807112 287.219500
22 12288.0 420.701865 254.453844 294.911986
23 12800.0 413.458944 252.009851 287.910035
24 13312.0 411.181478 253.763296 290.972683
25 13824.0 403.620451 258.191439 292.829653
26 14336.0 394.116833 255.240352 289.372589
27 14848.0 385.245405 256.552919 289.952797
28 15360.0 379.649845 262.751252 289.811315
29 15872.0 370.913333 261.806182 289.899545
5 3584.0 547.872604 208.271186 311.652167
6 4096.0 568.231237 220.412561 297.890900
7 4608.0 504.986315 232.825259 286.507772
8 5120.0 529.655159 242.845844 285.104413
9 5632.0 545.032265 243.545956 289.438969
10 6144.0 548.163546 248.661056 285.767458
11 6656.0 534.260858 256.000009 285.767438
12 7168.0 507.469040 260.457220 286.242939
13 7680.0 481.253256 262.190612 275.104486
14 8192.0 462.607053 267.130429 284.939124
15 8704.0 417.791980 267.815384 284.599455
16 9216.0 431.157889 272.394084 288.751954
17 9728.0 438.857162 280.615388 290.027323
18 10240.0 449.287041 286.433562 287.438599
19 10752.0 427.231788 247.172406 290.594591
20 11264.0 427.071098 245.760001 286.676558
21 11776.0 422.457417 249.667843 288.686414
22 12288.0 419.504980 254.453844 294.029924
23 12800.0 414.016170 253.256381 289.538159
24 13312.0 411.181478 252.759501 289.916513
25 13824.0 404.112047 257.190689 292.056329
26 14336.0 393.215988 254.485198 286.719986
27 14848.0 385.245405 257.665934 289.246765
28 15360.0 373.495460 257.970599 287.102804
29 15872.0 371.637071 261.806182 289.899545
</pre></div>
</div>
<div class="line-block">
@@ -487,7 +487,7 @@ to download the full example code</p>
<span class="n">bench_layer_norm</span><span class="o">.</span><span class="n">run</span><span class="p">(</span><span class="n">save_path</span><span class="o">=</span><span class="s1">&#39;.&#39;</span><span class="p">,</span> <span class="n">print_data</span><span class="o">=</span><span class="kc">True</span><span class="p">)</span>
</pre></div>
</div>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 12.137 seconds)</p>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 12.324 seconds)</p>
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-05-layer-norm-py">
<div class="sphx-glr-download sphx-glr-download-python docutils container">
<p><a class="reference download internal" download="" href="../../_downloads/935c0dd0fbeb4b2e69588471cbb2d4b2/05-layer-norm.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">05-layer-norm.py</span></code></a></p>