Philippe Tillet
cf2dba43ef
Backend: A lot of bugfixes in dot() for handling shapes better
2015-06-30 17:55:57 -04:00
Philippe Tillet
e7cabf65ac
Tuning: Merged tune branch.
...
- Much cleaner and more concise source
- Better exceptions handling
- Checks local minima to see if retuning is needed.
Resolved conflicts:
bench/blas.cpp
include/isaac/backend/templates/mproduct.h
include/isaac/driver/buffer.h
lib/array.cpp
lib/backend/templates/mproduct.cpp
lib/driver/buffer.cpp
python/setup.py
tune/pysrc/autotune.py
tune/pysrc/dataset.py
tune/pysrc/misc_tools.py
2015-06-28 17:53:16 -07:00
Philippe Tillet
48073dc710
C++: improved temporaries handling
2015-06-28 00:06:49 -07:00
Philippe
8f19d2a69c
C++/clBLAS: Bugfix in GEMM
2015-06-27 13:54:26 -04:00
Philippe
4cce9d3efd
C: More clBLAS tests
2015-06-27 11:44:50 -04:00
Philippe Tillet
0bead2d164
Python: Fixed missing space in setup.py
2015-06-24 16:50:50 -07:00
Philippe Tillet
80bcbd095f
C++: Some renaming; added possibility to pass buffers when constructing arrays
2015-06-23 09:38:34 -07:00
Philippe Tillet
0c9bf8da4e
Python: Added find_library in cmake/python/setup.py
2015-05-16 16:16:02 -04:00
Philippe Tillet
05e730f06e
CUDA: Many fixes in the backend
2015-05-13 02:26:38 -04:00
Philippe Tillet
e1506097b2
Python: now removing the build directory while packaging
2015-05-04 21:26:27 -04:00
Philippe Tillet
4a44361e06
Python: fixed setup.py
2015-05-04 02:15:29 -04:00
Philippe Tillet
3b983cf32f
CMake: some cleaning
2015-05-03 17:41:23 -04:00
Philippe Tillet
6b61bd3169
Python: added symlinks for src/include and src/lib
2015-05-03 03:44:53 -04:00
Philippe Tillet
7dba13c206
Structure: Python module is now standalone (setup.py generated by cmake). Better directory hierarchy
2015-05-03 03:40:14 -04:00
Philippe Tillet
29bfff69e9
Python: now statically linking (i.e. recompiling everything) for better portability.
2015-05-03 02:14:02 -04:00
Philippe Tillet
1a3923c562
Python: Wrapped exception
2015-05-01 11:29:39 -04:00
Philippe Tillet
86ffc7f6a6
Python: Better file-architecture in wrapper
2015-04-30 20:37:12 -04:00
Philippe Tillet
a65e8c32e0
Revert: Removed build files / old python wrapper
2015-04-30 02:42:09 -04:00
Philippe Tillet
e29fda1b1c
test
2015-04-30 02:25:14 -04:00
Philippe Tillet
006d0f13de
Packaging: polished
2015-04-30 00:46:42 -04:00
Philippe Tillet
5ef01f041a
Python: Refactored wrapper
2015-04-29 17:48:57 -04:00
Philippe Tillet
cf5028d55b
Squashed feature branch:
...
* Added CUDA support
* Performance improvements
* API improvements
* Added "depth" parameter to GEMM
* Android cross-compilation
2015-04-29 15:52:21 -04:00
Philippe Tillet
e453031094
More efficient access pattern in the GEMV kernel
2015-02-11 02:06:16 -05:00
Philippe Tillet
7e65601534
fixup
2015-02-08 23:22:48 -05:00
Philippe Tillet
a6d7671831
removing C++11 interface
2015-02-08 23:19:38 -05:00
Philippe Tillet
9c68704f09
Now using a list of event instead of a single one
2015-02-08 00:56:24 -05:00
Philippe Tillet
b768e913c9
Now using events to time autotuning
2015-02-06 22:11:03 -05:00
Philippe Tillet
e214927b16
Better control flow through options
2015-02-05 04:43:50 -05:00
Philippe Tillet
bbf2f0188e
Ported to C++11
2015-02-05 04:43:40 -05:00
Philippe Tillet
535706f35a
Some renaming; lower overhead in benchmark
2015-02-01 22:28:49 -05:00
Philippe Tillet
f0bb130416
Auto-tuner: Renamed "json_file" to "out"
2015-02-01 21:30:45 -05:00
Philippe Tillet
3f1fa822f8
save
2015-02-01 15:58:05 -05:00
Philippe Tillet
d29f1252ad
Clearer array_expression with hopefully lower overhead.
...
Also removed pyc's
2015-01-31 22:01:48 -05:00
Philippe Tillet
c7665021d1
reducing overhead; reverted custom CL/ header because CL/cl.hpp was buggy
2015-01-28 23:04:19 -05:00
Philippe Tillet
04cec21752
Fixed warnings and compilation for pyatidlas
2015-01-28 19:50:47 -05:00
Philippe Tillet
0dcf4d3617
Better exception handling, lowered CMake requirement ; blas-bench now benchmarks square matrices
2015-01-28 17:08:39 -05:00
Philippe Tillet
c37d8a2a81
Now using system CL include
2015-01-27 16:19:50 -05:00
Philippe Tillet
2a249d26c6
Added postinstall script
2015-01-27 13:06:25 -05:00
Philippe Tillet
909e4b16a0
added preinstall script
2015-01-26 00:56:02 -05:00
Philippe Tillet
6d7f103ab1
Added GEMM benchmark
2015-01-25 18:19:19 -05:00
Philippe Tillet
da9defac04
No more random order in default initialization
...
Added linking to MKL/CBLAS
2015-01-24 14:51:48 -05:00
Philippe Tillet
9a76be3edc
Bugfix in autotuner
2015-01-21 20:08:52 -05:00
Philippe Tillet
0068560bc6
Some cleaning + outer product
2015-01-17 10:49:36 -05:00
Philippe Tillet
f9e5221ec8
removed useless files
2015-01-16 07:38:26 -05:00
Philippe Tillet
69311b7982
Now ATIDLAS is standalone. Everything dynamic....
2015-01-12 13:24:06 -05:00
Karl Rupp
7c4f8407eb
PyATIDLAS: Fixed typo (Bart -> Barts)
2014-11-06 23:12:26 +01:00
Philippe Tillet
6595d20c34
Fixed over-head bug in the auto-tuner (not in the benchmarks)
2014-11-06 16:15:22 -05:00
Karl Rupp
71c72970f6
PyATIDLAS: Added architecture for Maxwell (sm_50)
2014-11-06 13:57:54 +01:00
Philippe Tillet
2be82fc272
Added support for Devastator and Scrapper
2014-11-02 11:28:05 -05:00
Philippe Tillet
f892d4d85d
Added pyviennacl as a submodule
2014-11-02 11:05:19 -05:00