U-AMR\ptillet
e8dc88b6f1
Code Quality: fixed compilation errors on MSVC
2015-07-20 18:02:56 -07:00
Philippe Tillet
cd155cb9e3
Code quality: Improved compliance to MSVC
2015-07-21 17:18:50 -04:00
Philippe Tillet
cbe930398e
Code quality: Cleaned directory folder, variable names and mingw compliance
2015-07-21 13:29:23 -04:00
Philippe Tillet
1e3c853b58
GEMM: Incorporated K bounds checking inside kernel
2015-07-16 14:25:32 -04:00
Philippe Tillet
2f106a9186
GEMM: Improved performance for cases other than NT
2015-07-10 21:15:36 -07:00
Philippe Tillet
84e47b871b
Tests: Now using prime-numbered sizes for GEMM
2015-07-11 12:19:14 -04:00
Philippe Tillet
cfa6ea812d
Cleaning: Largely renamed templates to BLAS-like names
2015-07-11 11:21:15 -04:00
Philippe Tillet
e25dcf97ea
Bugfix in SIMD handling for other layouts
2015-07-10 16:05:28 -04:00
Philippe Tillet
347f4025f2
Cleaned up GEMM
2015-07-09 15:03:55 -04:00
Philippe Tillet
b18442c220
Fixup
2015-07-07 23:39:17 -07:00
Philippe Tillet
bdd4ea05fd
Trying to further improve bounds checking
2015-07-08 22:37:57 -04:00
Philippe Tillet
4c123c4b38
Backend: GEMM - Improved bounds checking
2015-07-02 16:44:02 -04:00
Philippe Tillet
cf2dba43ef
Backend: A lot of bugfixes in dot() for handling shapes better
2015-06-30 17:55:57 -04:00
Philippe Tillet
e7cabf65ac
Tuning: Merged tune branch.
...
- Much cleaner and more concise source
- Better exceptions handling
- Checks local minima to see if retuning is needed.
Resolved conflicts:
bench/blas.cpp
include/isaac/backend/templates/mproduct.h
include/isaac/driver/buffer.h
lib/array.cpp
lib/backend/templates/mproduct.cpp
lib/driver/buffer.cpp
python/setup.py
tune/pysrc/autotune.py
tune/pysrc/dataset.py
tune/pysrc/misc_tools.py
2015-06-28 17:53:16 -07:00
Philippe Tillet
48073dc710
C++: improved temporaries handling
2015-06-28 00:06:49 -07:00
Philippe
8f19d2a69c
C++/clBLAS: Bugfix in GEMM
2015-06-27 13:54:26 -04:00
Philippe
4cce9d3efd
C: More clBLAS tests
2015-06-27 11:44:50 -04:00
Philippe Tillet
0bead2d164
Python: Fixed missing space in setup.py
2015-06-24 16:50:50 -07:00
Philippe Tillet
80bcbd095f
C++: Some renaming; added possibility to pass buffers when constructing arrays
2015-06-23 09:38:34 -07:00
Philippe Tillet
0c9bf8da4e
Python: Added find_library in cmake/python/setup.py
2015-05-16 16:16:02 -04:00
Philippe Tillet
05e730f06e
CUDA: Many fixes in the backend
2015-05-13 02:26:38 -04:00
Philippe Tillet
e1506097b2
Python: now removing the build directory while packaging
2015-05-04 21:26:27 -04:00
Philippe Tillet
4a44361e06
Python: fixed setup.py
2015-05-04 02:15:29 -04:00
Philippe Tillet
3b983cf32f
CMake: some cleaning
2015-05-03 17:41:23 -04:00
Philippe Tillet
7dba13c206
Structure: Python module is now standalone (setup.py generated by cmake). Better directory hierarchy
2015-05-03 03:40:14 -04:00
Philippe Tillet
29bfff69e9
Python: now statically linking (i.e. recompiling everything) for better portability.
2015-05-03 02:14:02 -04:00
Philippe Tillet
1a3923c562
Python: Wrapped exception
2015-05-01 11:29:39 -04:00
Philippe Tillet
a65e8c32e0
Revert: Removed build files / old python wrapper
2015-04-30 02:42:09 -04:00
Philippe Tillet
006d0f13de
Packaging: polished
2015-04-30 00:46:42 -04:00
Philippe Tillet
16c872f38d
More packaging ...
2014-10-16 06:57:38 -04:00
Philippe Tillet
d4eb3ed820
Now correctly linking to pyviennacl
2014-10-15 14:21:00 -04:00
Philippe Tillet
30089b7fa6
ATIDLAS's python wrapper is now stand-alone
2014-10-15 05:24:19 -04:00
Philippe Tillet
f91d3b422a
Now compiling ATIDLAS
2014-10-14 23:50:35 -04:00