Philippe Tillet
|
9de87da993
|
GEMM: swapped loops in rC[m][n]*=alpha
|
2015-07-16 10:40:38 -04:00 |
|
Philippe Tillet
|
4b004e1cd5
|
GEMM: Added pointers declaration to the beginning of the kernel
|
2015-07-14 20:48:52 -07:00 |
|
Philippe Tillet
|
6a74eb3340
|
GEMM: moved declaration of shared memory to the beginning of the kernel
|
2015-07-14 20:41:34 -07:00 |
|
Philippe Tillet
|
8be02a50c3
|
GEMM: Cleaned generated GEMM code a little bit
|
2015-07-14 20:40:29 -07:00 |
|
Philippe Tillet
|
1257dda310
|
GEMM: Fixed typo
|
2015-07-10 23:16:21 -07:00 |
|
Philippe Tillet
|
2f106a9186
|
GEMM: Improved performance for cases other than NT
|
2015-07-10 21:15:36 -07:00 |
|
Philippe Tillet
|
cfa6ea812d
|
Cleaning: Largely renamed templates to BLAS-like names
|
2015-07-11 11:21:15 -04:00 |
|