Computation timesΒΆ
16:48.611 total execution time for getting-started_tutorials files:
Matrix Multiplication ( |
06:21.317 |
0.0 MB |
Layer Normalization ( |
05:28.933 |
0.0 MB |
Fused Softmax ( |
03:26.653 |
0.0 MB |
Vector Addition ( |
01:31.609 |
0.0 MB |
Fused Attention ( |
00:00.076 |
0.0 MB |
Low-Memory Dropout ( |
00:00.012 |
0.0 MB |
Libdevice function ( |
00:00.011 |
0.0 MB |