[GH-PAGES] Updated website
This commit is contained in:
@@ -194,36 +194,36 @@ to download the full example code</p>
|
||||
<p class="sphx-glr-script-out">Out:</p>
|
||||
<div class="sphx-glr-script-out highlight-none notranslate"><div class="highlight"><pre><span></span>layer-norm-backward:
|
||||
N Triton Torch Apex
|
||||
0 1024.0 303.407414 98.303995 311.088617
|
||||
1 1536.0 344.523365 132.604320 338.201833
|
||||
0 1024.0 303.407414 98.698793 311.088617
|
||||
1 1536.0 347.773587 133.083026 341.333333
|
||||
2 2048.0 416.542360 157.538467 332.108094
|
||||
3 2560.0 451.764698 181.238943 325.079368
|
||||
4 3072.0 508.468972 190.020625 319.168834
|
||||
5 3584.0 540.981122 206.769233 307.199992
|
||||
6 4096.0 561.737163 219.919464 294.323343
|
||||
7 4608.0 489.345125 230.880998 290.267724
|
||||
8 5120.0 518.481012 242.366855 288.450695
|
||||
9 5632.0 534.260858 243.545956 290.683877
|
||||
10 6144.0 542.117638 249.925419 286.879370
|
||||
3 2560.0 451.764698 181.238943 328.556154
|
||||
4 3072.0 508.468972 190.511624 320.556515
|
||||
5 3584.0 540.981122 206.769233 308.301075
|
||||
6 4096.0 558.545450 219.919464 298.796351
|
||||
7 4608.0 489.345125 231.364016 286.507772
|
||||
8 5120.0 520.677950 242.366855 285.767451
|
||||
9 5632.0 534.260858 243.545956 291.310338
|
||||
10 6144.0 544.118087 249.925419 286.879370
|
||||
11 6656.0 532.479975 254.775119 285.767438
|
||||
12 7168.0 515.065851 252.988236 277.024148
|
||||
13 7680.0 488.912481 265.590783 283.569230
|
||||
14 8192.0 463.698115 257.677592 277.303250
|
||||
14 8192.0 464.794337 257.677592 277.303250
|
||||
15 8704.0 408.798442 266.448988 284.212242
|
||||
16 9216.0 421.302872 271.391419 289.129410
|
||||
17 9728.0 430.760152 278.939059 288.237038
|
||||
18 10240.0 438.074849 286.100109 289.469963
|
||||
19 10752.0 425.821771 245.526173 288.967529
|
||||
20 11264.0 426.397479 244.426754 285.465683
|
||||
21 11776.0 418.082825 248.133438 288.097854
|
||||
16 9216.0 422.106891 271.391419 289.129410
|
||||
17 9728.0 430.760152 279.272720 288.237038
|
||||
18 10240.0 438.074849 286.433562 289.129408
|
||||
19 10752.0 426.525614 245.760009 289.291486
|
||||
20 11264.0 427.071098 244.426754 285.465683
|
||||
21 11776.0 418.082825 248.569911 288.097854
|
||||
22 12288.0 416.542386 253.578674 293.737063
|
||||
23 12800.0 412.348979 252.839495 288.721817
|
||||
24 13312.0 409.862733 251.466350 288.607034
|
||||
23 12800.0 412.348979 253.047766 288.993430
|
||||
24 13312.0 410.125805 251.367424 288.607034
|
||||
25 13824.0 403.130022 256.197690 291.031592
|
||||
26 14336.0 395.021816 254.862216 288.402346
|
||||
27 14848.0 384.829370 256.552919 288.194100
|
||||
28 15360.0 376.547496 257.430175 286.656296
|
||||
29 15872.0 369.474279 260.731015 290.120338
|
||||
26 14336.0 395.021816 255.051144 288.402346
|
||||
27 14848.0 384.829370 256.737757 288.310684
|
||||
28 15360.0 376.547496 257.430175 287.550706
|
||||
29 15872.0 369.832994 260.731015 289.899545
|
||||
</pre></div>
|
||||
</div>
|
||||
<div class="line-block">
|
||||
@@ -477,7 +477,7 @@ to download the full example code</p>
|
||||
<span class="n">bench_layer_norm</span><span class="o">.</span><span class="n">run</span><span class="p">(</span><span class="n">save_path</span><span class="o">=</span><span class="s1">'.'</span><span class="p">,</span> <span class="n">print_data</span><span class="o">=</span><span class="kc">True</span><span class="p">)</span>
|
||||
</pre></div>
|
||||
</div>
|
||||
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 13.637 seconds)</p>
|
||||
<p class="sphx-glr-timing"><strong>Total running time of the script:</strong> ( 2 minutes 13.777 seconds)</p>
|
||||
<div class="sphx-glr-footer class sphx-glr-footer-example docutils container" id="sphx-glr-download-getting-started-tutorials-05-layer-norm-py">
|
||||
<div class="sphx-glr-download sphx-glr-download-python docutils container">
|
||||
<p><a class="reference download internal" download="" href="../../_downloads/935c0dd0fbeb4b2e69588471cbb2d4b2/05-layer-norm.py"><code class="xref download docutils literal notranslate"><span class="pre">Download</span> <span class="pre">Python</span> <span class="pre">source</span> <span class="pre">code:</span> <span class="pre">05-layer-norm.py</span></code></a></p>
|
||||
|
Reference in New Issue
Block a user