Philippe Tillet
|
33bd3a77fc
|
GEMM: Fixing bounds checking on K
|
2015-07-21 14:35:22 -04:00 |
|
Philippe Tillet
|
18663d6a93
|
Code Quality: Cleaned warnings
|
2015-07-20 23:07:53 -07:00 |
|
Philippe Tillet
|
5d301c2c7e
|
GEMM: fixed nasty bounds checking bug!
|
2015-07-20 22:58:45 -07:00 |
|
Philippe Tillet
|
cd155cb9e3
|
Code quality: Improved compliance to MSVC
|
2015-07-21 17:18:50 -04:00 |
|
Philippe Tillet
|
b922994a55
|
GEMM: Reverted offx, offy removal
|
2015-07-21 14:44:10 -04:00 |
|
Philippe Tillet
|
7fdb8c0457
|
GEMM: Removed offx, offy
|
2015-07-18 10:24:44 -07:00 |
|
Philippe Tillet
|
f4615446c5
|
GEMM: More optimizations
|
2015-07-18 17:23:53 -04:00 |
|
Philippe Tillet
|
6ccf32904a
|
GEMM: Still optimizing
|
2015-07-18 16:06:17 -04:00 |
|
Philippe Tillet
|
ab82a9c048
|
Cleaning GEMM test
|
2015-07-18 13:09:38 -04:00 |
|
Philippe Tillet
|
54ad83f4a6
|
GEMM: Removed harmful check
|
2015-07-17 14:02:34 -04:00 |
|
Philippe Tillet
|
cca19ec093
|
GEMM: More stuff
|
2015-07-16 00:30:36 -07:00 |
|
Philippe Tillet
|
88610e4c01
|
GEMM: New fetching policy
|
2015-07-15 23:28:43 -07:00 |
|
Philippe Tillet
|
3d4c9cf1e2
|
Tuning: Added AlexNet sizes for GEMM
|
2015-07-16 14:26:21 -04:00 |
|
Philippe Tillet
|
1e3c853b58
|
GEMM: Incorporated K bounds checking inside kernel
|
2015-07-16 14:25:32 -04:00 |
|
Philippe Tillet
|
9de87da993
|
GEMM: swapped loops in rC[m][n]*=alpha
|
2015-07-16 10:40:38 -04:00 |
|
Philippe Tillet
|
4b004e1cd5
|
GEMM: Added pointers declaration to the beginning of the kernel
|
2015-07-14 20:48:52 -07:00 |
|
Philippe Tillet
|
6a74eb3340
|
GEMM: moved declaration of shared memory to the beginning of the kernel
|
2015-07-14 20:41:34 -07:00 |
|
Philippe Tillet
|
8be02a50c3
|
GEMM: Cleaned generated GEMM code a little bit
|
2015-07-14 20:40:29 -07:00 |
|
Philippe Tillet
|
1257dda310
|
GEMM: Fixed typo
|
2015-07-10 23:16:21 -07:00 |
|
Philippe Tillet
|
2f106a9186
|
GEMM: Improved performance for cases other than NT
|
2015-07-10 21:15:36 -07:00 |
|
Philippe Tillet
|
cfa6ea812d
|
Cleaning: Largely renamed templates to BLAS-like names
|
2015-07-11 11:21:15 -04:00 |
|