Philippe Tillet
5f78dedbcd
Tests: Added double precision check
2015-07-26 21:35:39 -07:00
Philippe Tillet
4715723e61
Driver: Fixed issue in ownership handling for BLAS
2015-07-26 21:13:28 -07:00
Philippe Tillet
674d708425
Backend: fixed ocl::info<> for CL_PROGRAM_BINARIES
2015-07-26 18:30:06 -07:00
Philippe Tillet
0ef6654c5f
Code quality: removed dependencies on the C++ OpenCL wrapper
2015-07-26 10:05:16 -07:00
Philippe Tillet
a2b533b9a8
Driver: made cl and cu attributes private in Handle<>
2015-07-23 09:40:18 -07:00
Philippe Tillet
155554f5cf
Code quality: added clBLAS.def and some ISAACAPI
2015-07-21 23:48:50 -07:00
Philippe Tillet
5eb9212535
Code quality: more ISAACEXPORT
2015-07-21 22:02:36 -07:00
Philippe Tillet
d22d86e5c7
Code quality: Added ISAACAPI macro for exporting symbols into shared library ; exported symbols in array.h
2015-07-21 21:35:56 -07:00
Philippe Tillet
18663d6a93
Code Quality: Cleaned warnings
2015-07-20 23:07:53 -07:00
U-AMR\ptillet
8879a867d8
Code Quality: fixed compilation errors on MSVC
2015-07-20 18:05:31 -07:00
Philippe Tillet
cd155cb9e3
Code quality: Improved compliance to MSVC
2015-07-21 17:18:50 -04:00
Philippe Tillet
cbe930398e
Code quality: Cleaned directory folder, variable names and mingw compliance
2015-07-21 13:29:23 -04:00
Philippe Tillet
cfa6ea812d
Cleaning: Largely renamed templates to BLAS-like names
2015-07-11 11:21:15 -04:00
Philippe Tillet
9d0d50ba05
Backend: Fixed alpha, beta in GEMM.
2015-06-29 21:52:50 -07:00
Philippe Tillet
e7cabf65ac
Tuning: Merged tune branch.
...
- Much cleaner and more concise source
- Better exceptions handling
- Checks local minima to see if retuning is needed.
Resolved conflicts:
bench/blas.cpp
include/isaac/backend/templates/mproduct.h
include/isaac/driver/buffer.h
lib/array.cpp
lib/backend/templates/mproduct.cpp
lib/driver/buffer.cpp
python/setup.py
tune/pysrc/autotune.py
tune/pysrc/dataset.py
tune/pysrc/misc_tools.py
2015-06-28 17:53:16 -07:00
Philippe Tillet
0e207e7ca4
Backend: Now not creating a temporary upon C = alpha*dot(op(A), op(B)) + beta*C
2015-06-27 17:55:01 -07:00
Philippe
4cce9d3efd
C: More clBLAS tests
2015-06-27 11:44:50 -04:00
Philippe Tillet
e6cecc5a09
C: Some fixes in BLAS
2015-06-26 08:08:22 -07:00
Philippe Tillet
b0cd25ac4b
Added C BLAS1 test
2015-06-25 23:12:26 -07:00
Philippe Tillet
b32de3ac76
C++: More clBLAS routines
2015-06-25 08:12:16 -07:00
Philippe Tillet
a42112f8f3
C++: Fixed warning in execution_option's queue initialization
2015-06-24 16:51:55 -07:00
Philippe Tillet
9f7e34ba5d
C++: Added clBLAS sGEMM ABI (still buggy)
2015-06-24 07:51:27 -07:00
Philippe Tillet
80bcbd095f
C++: Some renaming; added possibility to pass buffers when constructing arrays
2015-06-23 09:38:34 -07:00
Philippe Tillet
05e730f06e
CUDA: Many fixes in the backend
2015-05-13 02:26:38 -04:00
Philippe Tillet
f55e499ef5
C++: added support for [unsigned] long long
2015-05-04 23:54:43 -04:00
Philippe Tillet
278109eef8
C++: Now using standard C++ types instead of stdint
2015-05-04 21:23:05 -04:00
Philippe Tillet
5cdbef7b4e
C++: in value_scalar replaced cl types by stdint types
2015-05-04 19:05:32 -04:00
Philippe Tillet
cf5028d55b
Squashed feature branch:
...
* Added CUDA support
* Performance improvements
* API improvements
* Added "depth" parameter to GEMM
* Android cross-compilation
2015-04-29 15:52:21 -04:00