Philippe Tillet
cbe930398e
Code quality: Cleaned directory folder, variable names and mingw compliance
2015-07-21 13:29:23 -04:00
Philippe Tillet
7fdb8c0457
GEMM: Removed offx, offy
2015-07-18 10:24:44 -07:00
Philippe Tillet
f4615446c5
GEMM: More optimizations
2015-07-18 17:23:53 -04:00
Philippe Tillet
6ccf32904a
GEMM: Still optimizing
2015-07-18 16:06:17 -04:00
Philippe Tillet
ab82a9c048
Cleaning GEMM test
2015-07-18 13:09:38 -04:00
Philippe Tillet
54ad83f4a6
GEMM: Removed harmful check
2015-07-17 14:02:34 -04:00
Philippe Tillet
cca19ec093
GEMM: More stuff
2015-07-16 00:30:36 -07:00
Philippe Tillet
88610e4c01
GEMM: New fetching policy
2015-07-15 23:28:43 -07:00
Philippe Tillet
3d4c9cf1e2
Tuning: Added AlexNet sizes for GEMM
2015-07-16 14:26:21 -04:00
Philippe Tillet
1e3c853b58
GEMM: Incorporated K bounds checking inside kernel
2015-07-16 14:25:32 -04:00
Philippe Tillet
9de87da993
GEMM: swapped loops in rC[m][n]*=alpha
2015-07-16 10:40:38 -04:00
Philippe Tillet
4b004e1cd5
GEMM: Added pointers declaration to the beginning of the kernel
2015-07-14 20:48:52 -07:00
Philippe Tillet
6a74eb3340
GEMM: moved declaration of shared memory to the beginning of the kernel
2015-07-14 20:41:34 -07:00
Philippe Tillet
8be02a50c3
GEMM: Cleaned generated GEMM code a little bit
2015-07-14 20:40:29 -07:00
Philippe Tillet
753a9b1f3e
Benchmarks: now benchmaring all AlexNet sizes
2015-07-14 13:33:23 -04:00
Philippe Tillet
e2f45f55f3
Tuning: Bugfixes and loosened local minima check
2015-07-12 23:19:00 -07:00
Philippe Tillet
1257dda310
GEMM: Fixed typo
2015-07-10 23:16:21 -07:00
Philippe Tillet
2f106a9186
GEMM: Improved performance for cases other than NT
2015-07-10 21:15:36 -07:00
Philippe Tillet
84e47b871b
Tests: Now using prime-numbered sizes for GEMM
2015-07-11 12:19:14 -04:00
Philippe Tillet
cfa6ea812d
Cleaning: Largely renamed templates to BLAS-like names
2015-07-11 11:21:15 -04:00
Philippe Tillet
281fa9c7a6
Benchmarks: Now testing AlexNet's size
2015-07-10 16:05:28 -04:00
Philippe Tillet
2b10363668
GEMM: More bugfixes
2015-07-10 16:05:28 -04:00
Philippe Tillet
e25dcf97ea
Bugfix in SIMD handling for other layouts
2015-07-10 16:05:28 -04:00
Philippe Tillet
47406a5e50
Implementing vector for other layouts
2015-07-09 20:07:44 -04:00
Philippe Tillet
347f4025f2
Cleaned up GEMM
2015-07-09 15:03:55 -04:00
Philippe Tillet
4ec061ceeb
More...
2015-07-09 13:32:32 -04:00
Philippe Tillet
931a403d81
More fix
2015-07-09 13:09:01 -04:00
Philippe Tillet
a676b15448
Fixup
2015-07-09 11:40:26 -04:00
Philippe Tillet
4e25e20206
More bounds checking
2015-07-09 10:52:54 -04:00
Philippe Tillet
b18442c220
Fixup
2015-07-07 23:39:17 -07:00
Philippe Tillet
bdd4ea05fd
Trying to further improve bounds checking
2015-07-08 22:37:57 -04:00
Philippe Tillet
4c123c4b38
Backend: GEMM - Improved bounds checking
2015-07-02 16:44:02 -04:00
Philippe Tillet
41204d6b74
Tests: Random initialization ; alpha is now negative in axpy
2015-07-01 11:48:24 -04:00
Philippe Tillet
5c720a5b54
Backend: Fixed AXPY for shape=(1,x>1)
2015-07-01 11:48:01 -04:00
Philippe Tillet
9d0d50ba05
Backend: Fixed alpha, beta in GEMM.
2015-06-29 21:52:50 -07:00
Philippe Tillet
cf2dba43ef
Backend: A lot of bugfixes in dot() for handling shapes better
2015-06-30 17:55:57 -04:00
Philippe Tillet
e7cabf65ac
Tuning: Merged tune branch.
...
- Much cleaner and more concise source
- Better exceptions handling
- Checks local minima to see if retuning is needed.
Resolved conflicts:
bench/blas.cpp
include/isaac/backend/templates/mproduct.h
include/isaac/driver/buffer.h
lib/array.cpp
lib/backend/templates/mproduct.cpp
lib/driver/buffer.cpp
python/setup.py
tune/pysrc/autotune.py
tune/pysrc/dataset.py
tune/pysrc/misc_tools.py
2015-06-28 17:53:16 -07:00
Philippe Tillet
48073dc710
C++: improved temporaries handling
2015-06-28 00:06:49 -07:00
Philippe Tillet
0e207e7ca4
Backend: Now not creating a temporary upon C = alpha*dot(op(A), op(B)) + beta*C
2015-06-27 17:55:01 -07:00
Philippe Tillet
3525edd54c
BLAS: Added row-major support and tests
2015-06-27 15:22:26 -04:00
Philippe
8f19d2a69c
C++/clBLAS: Bugfix in GEMM
2015-06-27 13:54:26 -04:00
Philippe
743a559f76
Backend: Bugfix in GEMM bound-checking
2015-06-27 13:14:46 -04:00
Philippe
4cce9d3efd
C: More clBLAS tests
2015-06-27 11:44:50 -04:00
Philippe Tillet
e6cecc5a09
C: Some fixes in BLAS
2015-06-26 08:08:22 -07:00
Philippe Tillet
b0cd25ac4b
Added C BLAS1 test
2015-06-25 23:12:26 -07:00
Philippe Tillet
b32de3ac76
C++: More clBLAS routines
2015-06-25 08:12:16 -07:00
Philippe Tillet
a42112f8f3
C++: Fixed warning in execution_option's queue initialization
2015-06-24 16:51:55 -07:00
Philippe Tillet
0bead2d164
Python: Fixed missing space in setup.py
2015-06-24 16:50:50 -07:00
Philippe Tillet
9f7e34ba5d
C++: Added clBLAS sGEMM ABI (still buggy)
2015-06-24 07:51:27 -07:00
Philippe Tillet
c61eaceb21
Tune: Fixed correctness
2015-06-23 09:54:51 -07:00