Commit Graph

81 Commits

Author SHA1 Message Date
U-AMR\ptillet
8879a867d8 Code Quality: fixed compilation errors on MSVC 2015-07-20 18:05:31 -07:00
Philippe Tillet
cd155cb9e3 Code quality: Improved compliance to MSVC 2015-07-21 17:18:50 -04:00
Philippe Tillet
cbe930398e Code quality: Cleaned directory folder, variable names and mingw compliance 2015-07-21 13:29:23 -04:00
Philippe Tillet
1e3c853b58 GEMM: Incorporated K bounds checking inside kernel 2015-07-16 14:25:32 -04:00
Philippe Tillet
2f106a9186 GEMM: Improved performance for cases other than NT 2015-07-10 21:15:36 -07:00
Philippe Tillet
84e47b871b Tests: Now using prime-numbered sizes for GEMM 2015-07-11 12:19:14 -04:00
Philippe Tillet
cfa6ea812d Cleaning: Largely renamed templates to BLAS-like names 2015-07-11 11:21:15 -04:00
Philippe Tillet
e25dcf97ea Bugfix in SIMD handling for other layouts 2015-07-10 16:05:28 -04:00
Philippe Tillet
347f4025f2 Cleaned up GEMM 2015-07-09 15:03:55 -04:00
Philippe Tillet
b18442c220 Fixup 2015-07-07 23:39:17 -07:00
Philippe Tillet
bdd4ea05fd Trying to further improve bounds checking 2015-07-08 22:37:57 -04:00
Philippe Tillet
4c123c4b38 Backend: GEMM - Improved bounds checking 2015-07-02 16:44:02 -04:00
Philippe Tillet
cf2dba43ef Backend: A lot of bugfixes in dot() for handling shapes better 2015-06-30 17:55:57 -04:00
Philippe Tillet
e7cabf65ac Tuning: Merged tune branch.
- Much cleaner and more concise source
- Better exceptions handling
- Checks local minima to see if retuning is needed.

Resolved conflicts:
	bench/blas.cpp
	include/isaac/backend/templates/mproduct.h
	include/isaac/driver/buffer.h
	lib/array.cpp
	lib/backend/templates/mproduct.cpp
	lib/driver/buffer.cpp
	python/setup.py
	tune/pysrc/autotune.py
	tune/pysrc/dataset.py
	tune/pysrc/misc_tools.py
2015-06-28 17:53:16 -07:00
Philippe Tillet
48073dc710 C++: improved temporaries handling 2015-06-28 00:06:49 -07:00
Philippe
8f19d2a69c C++/clBLAS: Bugfix in GEMM 2015-06-27 13:54:26 -04:00
Philippe
4cce9d3efd C: More clBLAS tests 2015-06-27 11:44:50 -04:00
Philippe Tillet
0bead2d164 Python: Fixed missing space in setup.py 2015-06-24 16:50:50 -07:00
Philippe Tillet
80bcbd095f C++: Some renaming; added possibility to pass buffers when constructing arrays 2015-06-23 09:38:34 -07:00
Philippe Tillet
0c9bf8da4e Python: Added find_library in cmake/python/setup.py 2015-05-16 16:16:02 -04:00
Philippe Tillet
05e730f06e CUDA: Many fixes in the backend 2015-05-13 02:26:38 -04:00
Philippe Tillet
e1506097b2 Python: now removing the build directory while packaging 2015-05-04 21:26:27 -04:00
Philippe Tillet
4a44361e06 Python: fixed setup.py 2015-05-04 02:15:29 -04:00
Philippe Tillet
3b983cf32f CMake: some cleaning 2015-05-03 17:41:23 -04:00
Philippe Tillet
6b61bd3169 Python: added symlinks for src/include and src/lib 2015-05-03 03:44:53 -04:00
Philippe Tillet
7dba13c206 Structure: Python module is now standalone (setup.py generated by cmake). Better directory hierarchy 2015-05-03 03:40:14 -04:00
Philippe Tillet
29bfff69e9 Python: now statically linking (i.e. recompiling everything) for better portability. 2015-05-03 02:14:02 -04:00
Philippe Tillet
1a3923c562 Python: Wrapped exception 2015-05-01 11:29:39 -04:00
Philippe Tillet
86ffc7f6a6 Python: Better file-architecture in wrapper 2015-04-30 20:37:12 -04:00
Philippe Tillet
a65e8c32e0 Revert: Removed build files / old python wrapper 2015-04-30 02:42:09 -04:00
Philippe Tillet
e29fda1b1c test 2015-04-30 02:25:14 -04:00
Philippe Tillet
006d0f13de Packaging: polished 2015-04-30 00:46:42 -04:00
Philippe Tillet
5ef01f041a Python: Refactored wrapper 2015-04-29 17:48:57 -04:00
Philippe Tillet
cf5028d55b Squashed feature branch:
* Added CUDA support
 * Performance improvements
 * API improvements
 * Added "depth" parameter to GEMM
 * Android cross-compilation
2015-04-29 15:52:21 -04:00
Philippe Tillet
e453031094 More efficient access pattern in the GEMV kernel 2015-02-11 02:06:16 -05:00
Philippe Tillet
7e65601534 fixup 2015-02-08 23:22:48 -05:00
Philippe Tillet
a6d7671831 removing C++11 interface 2015-02-08 23:19:38 -05:00
Philippe Tillet
9c68704f09 Now using a list of event instead of a single one 2015-02-08 00:56:24 -05:00
Philippe Tillet
b768e913c9 Now using events to time autotuning 2015-02-06 22:11:03 -05:00
Philippe Tillet
e214927b16 Better control flow through options 2015-02-05 04:43:50 -05:00
Philippe Tillet
bbf2f0188e Ported to C++11 2015-02-05 04:43:40 -05:00
Philippe Tillet
535706f35a Some renaming; lower overhead in benchmark 2015-02-01 22:28:49 -05:00
Philippe Tillet
f0bb130416 Auto-tuner: Renamed "json_file" to "out" 2015-02-01 21:30:45 -05:00
Philippe Tillet
3f1fa822f8 save 2015-02-01 15:58:05 -05:00
Philippe Tillet
d29f1252ad Clearer array_expression with hopefully lower overhead.
Also removed pyc's
2015-01-31 22:01:48 -05:00
Philippe Tillet
c7665021d1 reducing overhead; reverted custom CL/ header because CL/cl.hpp was buggy 2015-01-28 23:04:19 -05:00
Philippe Tillet
04cec21752 Fixed warnings and compilation for pyatidlas 2015-01-28 19:50:47 -05:00
Philippe Tillet
0dcf4d3617 Better exception handling, lowered CMake requirement ; blas-bench now benchmarks square matrices 2015-01-28 17:08:39 -05:00
Philippe Tillet
c37d8a2a81 Now using system CL include 2015-01-27 16:19:50 -05:00
Philippe Tillet
2a249d26c6 Added postinstall script 2015-01-27 13:06:25 -05:00