[GH-PAGES] Updated website

This commit is contained in:
Philippe Tillet
2022-08-06 00:49:10 +00:00
parent 844e79e14c
commit 73ee4b1d0d
167 changed files with 288 additions and 288 deletions

View File

@@ -196,35 +196,35 @@ to download the full example code</p>
<p class="sphx-glr-script-out">Out:</p>
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>layer-norm:
N Triton Torch Apex
0 1024.0 606.814814 277.694907 468.114273
0 1024.0 585.142849 277.694907 468.114273
1 1536.0 630.153868 323.368435 511.999982
2 2048.0 682.666643 334.367358 520.126988
3 2560.0 694.237267 365.714281 512.000013
4 3072.0 712.347810 378.092307 496.484863
5 3584.0 725.873439 384.859062 451.527536
6 4096.0 728.177767 381.023256 455.111095
7 4608.0 670.254540 394.267384 426.173427
8 5120.0 688.403381 397.669909 422.268057
9 5632.0 704.000002 395.228063 415.262685
2 2048.0 682.666643 337.814445 520.126988
3 2560.0 694.237267 365.714281 518.481028
4 3072.0 712.347810 375.206126 496.484863
5 3584.0 725.873439 384.859062 455.111115
6 4096.0 728.177767 381.023256 442.810792
7 4608.0 670.254540 396.387087 426.173427
8 5120.0 688.403381 397.669909 426.666652
9 5632.0 698.542675 398.725657 411.470331
10 6144.0 697.191505 402.885254 409.600010
11 6656.0 700.631610 400.360920 400.360920
12 7168.0 690.891575 396.844306 387.459443
13 7680.0 678.895043 393.846167 386.415087
14 8192.0 636.271854 393.609605 371.308771
15 8704.0 627.315309 389.005597 381.545190
16 9216.0 606.814809 407.337026 383.999986
17 9728.0 587.350922 409.599987 383.369452
18 10240.0 564.965524 408.578556 382.803739
19 10752.0 547.872604 411.559798 381.445676
20 11264.0 533.207081 406.826188 373.134567
21 11776.0 520.486200 409.599991 377.587162
22 12288.0 514.680630 413.911572 383.251457
23 12800.0 504.433489 410.420828 376.470582
24 13312.0 494.180982 405.699062 376.976995
25 13824.0 482.934503 411.888257 379.389355
26 14336.0 471.967074 406.695045 374.185964
27 14848.0 461.297068 408.192434 375.304904
28 15360.0 454.269882 406.214870 378.092307
11 6656.0 700.631610 400.360920 398.861429
12 7168.0 690.891575 382.293315 382.293315
13 7680.0 678.895043 392.587863 386.415087
14 8192.0 636.271854 392.431125 374.491442
15 8704.0 624.502255 392.292962 380.502740
16 9216.0 606.814809 403.989025 383.002605
17 9728.0 587.350922 407.455499 382.427505
18 10240.0 566.920437 407.562184 381.911416
19 10752.0 547.872604 410.577576 380.601764
20 11264.0 533.207081 396.096702 369.311483
21 11776.0 521.927959 407.826843 377.587162
22 12288.0 516.031509 413.042029 382.505826
23 12800.0 504.433489 408.782457 376.470582
24 13312.0 494.180982 401.871683 375.647260
25 13824.0 482.934503 409.600016 378.092325
26 14336.0 471.967074 398.914774 372.969090
27 14848.0 461.297068 403.341254 374.712936
28 15360.0 454.269882 406.887417 378.092307
29 15872.0 447.887117 406.974373 376.225175
</pre></div>
</div>
@@ -543,7 +543,7 @@ to download the full example code</p>
<span class="n">bench_layer_norm</span><span class="o">.</span><span class="n">run</span><span class="p">(</span><span class="n">save_path</span><span class="o">=</span><span class="s1">&#39;.&#39;</span><span class="p">,</span> <span class="n">print_data</span><span class="o">=</span><span class="kc">True</span><span class="p">)</span>
</pre></div>
</div>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 5 minutes 38.270 seconds)</p>
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 5 minutes 38.714 seconds)</p>
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-05-layer-norm-py">
<div class="sphx-glr-download sphx-glr-download-python docutils container">
<p><a class="reference download internal" download="" href="../../_downloads/935c0dd0fbeb4b2e69588471cbb2d4b2/05-layer-norm.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">05-layer-norm.py</span></code></a></p>