Computation timesΒΆ
18:18.301 total execution time for getting-started_tutorials files:
Matrix Multiplication ( |
07:14.455 |
0.0 MB |
Layer Normalization ( |
05:40.298 |
0.0 MB |
Fused Softmax ( |
03:30.958 |
0.0 MB |
Vector Addition ( |
01:51.976 |
0.0 MB |
Low-Memory Dropout ( |
00:00.283 |
0.0 MB |
Libdevice function ( |
00:00.249 |
0.0 MB |
Fused Attention ( |
00:00.083 |
0.0 MB |