Computation timesΒΆ
18:11.256 total execution time for getting-started_tutorials files:
Matrix Multiplication ( |
07:11.851 |
0.0 MB |
Layer Normalization ( |
05:37.085 |
0.0 MB |
Fused Softmax ( |
03:32.100 |
0.0 MB |
Vector Addition ( |
01:49.605 |
0.0 MB |
Low-Memory Dropout ( |
00:00.290 |
0.0 MB |
Libdevice function ( |
00:00.253 |
0.0 MB |
Fused Attention ( |
00:00.072 |
0.0 MB |