[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-05-04 00:43:20 +00:00
parent af77440e1b
commit d420763e0b
158 changed files with 290 additions and 290 deletions

View File

@@ -194,36 +194,36 @@ to download the full example code</p>
<p class="sphx-glr-script-out">Out:</p>
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>layer-norm-backward:
N Triton Torch Apex
0 1024.0 307.200008 97.912354 303.407414
1 1536.0 351.085717 134.050910 341.333333
2 2048.0 423.724127 160.627450 334.367350
3 2560.0 461.954908 180.705883 330.322572
4 3072.0 515.580429 191.999993 323.368415
5 3584.0 551.384634 207.768111 310.527060
6 4096.0 568.231237 220.412561 297.890900
7 4608.0 498.162157 232.336141 287.251954
8 5120.0 525.128191 242.366855 284.444444
9 5632.0 538.517949 243.107920 289.438969
10 6144.0 542.117638 248.661056 286.879370
11 6656.0 528.953642 255.590406 285.257135
12 7168.0 505.976473 260.063480 284.821192
13 7680.0 485.052616 262.751252 280.121579
14 8192.0 460.440290 266.406514 284.526763
15 8704.0 416.127506 267.472468 284.987724
16 9216.0 429.483477 271.724806 288.375482
17 9728.0 437.213490 280.615388 289.667485
18 10240.0 446.025405 286.433562 289.811322
19 10752.0 429.364408 246.699797 290.267711
20 11264.0 429.104745 245.536784 286.980888
21 11776.0 423.089806 249.667843 288.981596
22 12288.0 418.909088 254.453844 294.911986
23 12800.0 414.016170 253.884294 288.180121
24 13312.0 411.711355 253.160074 290.443638
25 13824.0 406.090579 257.390218 292.056329
26 14336.0 396.387109 255.051144 287.198654
27 14848.0 386.498925 257.665934 289.717061
28 15360.0 376.163261 257.790220 288.000007
29 15872.0 368.046389 261.626369 290.562936
0 1024.0 311.088617 99.497980 311.088617
1 1536.0 354.461542 133.565214 341.333333
2 2048.0 423.724127 159.067963 321.254900
3 2560.0 461.954908 182.314537 326.808501
4 3072.0 519.211251 191.005181 321.956335
5 3584.0 551.384634 208.271186 308.301075
6 4096.0 568.231237 220.907859 300.623865
7 4608.0 498.162157 232.336141 287.999990
8 5120.0 525.128191 241.414550 285.104413
9 5632.0 538.517949 242.671458 288.204696
10 6144.0 546.133354 251.631408 288.563606
11 6656.0 534.260858 255.590406 284.242007
12 7168.0 508.970395 255.619613 278.820105
13 7680.0 485.052616 264.827585 281.404588
14 8192.0 461.521112 267.493874 282.077471
15 8704.0 416.958106 263.093202 281.152082
16 9216.0 431.157889 271.724806 289.129410
17 9728.0 438.857162 282.653752 291.840007
18 10240.0 446.836366 285.104413 288.112552
19 10752.0 432.966444 245.059832 287.999996
20 11264.0 429.104745 243.107920 285.465683
21 11776.0 421.826879 250.331271 288.981596
22 12288.0 420.701865 254.673582 294.911986
23 12800.0 414.016170 254.515329 288.993430
24 13312.0 412.242569 252.161013 289.653667
25 13824.0 403.130022 257.790206 293.347481
26 14336.0 396.844280 254.109315 286.959121
27 14848.0 383.999990 257.108233 288.777966
28 15360.0 376.932517 261.261510 289.129401
29 15872.0 368.046389 261.986243 291.229369
</pre></div>
</div>
<div class="line-block">
@@ -477,7 +477,7 @@ to download the full example code</p>
<span class="n">bench_layer_norm</span><span class="o">.</span><span class="n">run</span><span class="p">(</span><span class="n">save_path</span><span class="o">=</span><span class="s1">&#39;.&#39;</span><span class="p">,</span> <span class="n">print_data</span><span class="o">=</span><span class="kc">True</span><span class="p">)</span>
</pre></div>
</div>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 11.085 seconds)</p>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 12.419 seconds)</p>
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-05-layer-norm-py">
<div class="sphx-glr-download sphx-glr-download-python docutils container">
<p><a class="reference download internal" download="" href="../../_downloads/935c0dd0fbeb4b2e69588471cbb2d4b2/05-layer-norm.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">05-layer-norm.py</span></code></a></p>