Commit Graph

26 Commits

Author SHA1 Message Date
Philippe Tillet
22f8e45ce3 GEMM: Removed offx, offy 2015-07-23 11:20:50 -07:00
Philippe Tillet
d4d4b2d55e GEMM: Fixup cast in min() 2015-07-23 10:00:50 -07:00
Philippe Tillet
3e7791ad07 GEMM: Further register cleaning 2015-07-23 08:24:40 -07:00
Philippe Tillet
1cec0a9183 GEMM: Safer bounds checking for K 2015-07-22 17:47:02 -07:00
Philippe Tillet
79f833ba65 GEMM: Better update of Kx, Ky 2015-07-21 14:35:30 -04:00
Philippe Tillet
33bd3a77fc GEMM: Fixing bounds checking on K 2015-07-21 14:35:22 -04:00
Philippe Tillet
18663d6a93 Code Quality: Cleaned warnings 2015-07-20 23:07:53 -07:00
Philippe Tillet
5d301c2c7e GEMM: fixed nasty bounds checking bug! 2015-07-20 22:58:45 -07:00
Philippe Tillet
cd155cb9e3 Code quality: Improved compliance to MSVC 2015-07-21 17:18:50 -04:00
Philippe Tillet
b922994a55 GEMM: Reverted offx, offy removal 2015-07-21 14:44:10 -04:00
Philippe Tillet
7fdb8c0457 GEMM: Removed offx, offy 2015-07-18 10:24:44 -07:00
Philippe Tillet
f4615446c5 GEMM: More optimizations 2015-07-18 17:23:53 -04:00
Philippe Tillet
6ccf32904a GEMM: Still optimizing 2015-07-18 16:06:17 -04:00
Philippe Tillet
ab82a9c048 Cleaning GEMM test 2015-07-18 13:09:38 -04:00
Philippe Tillet
54ad83f4a6 GEMM: Removed harmful check 2015-07-17 14:02:34 -04:00
Philippe Tillet
cca19ec093 GEMM: More stuff 2015-07-16 00:30:36 -07:00
Philippe Tillet
88610e4c01 GEMM: New fetching policy 2015-07-15 23:28:43 -07:00
Philippe Tillet
3d4c9cf1e2 Tuning: Added AlexNet sizes for GEMM 2015-07-16 14:26:21 -04:00
Philippe Tillet
1e3c853b58 GEMM: Incorporated K bounds checking inside kernel 2015-07-16 14:25:32 -04:00
Philippe Tillet
9de87da993 GEMM: swapped loops in rC[m][n]*=alpha 2015-07-16 10:40:38 -04:00
Philippe Tillet
4b004e1cd5 GEMM: Added pointers declaration to the beginning of the kernel 2015-07-14 20:48:52 -07:00
Philippe Tillet
6a74eb3340 GEMM: moved declaration of shared memory to the beginning of the kernel 2015-07-14 20:41:34 -07:00
Philippe Tillet
8be02a50c3 GEMM: Cleaned generated GEMM code a little bit 2015-07-14 20:40:29 -07:00
Philippe Tillet
1257dda310 GEMM: Fixed typo 2015-07-10 23:16:21 -07:00
Philippe Tillet
2f106a9186 GEMM: Improved performance for cases other than NT 2015-07-10 21:15:36 -07:00
Philippe Tillet
cfa6ea812d Cleaning: Largely renamed templates to BLAS-like names 2015-07-11 11:21:15 -04:00