Philippe Tillet
5c720a5b54
Backend: Fixed AXPY for shape=(1,x>1)
2015-07-01 11:48:01 -04:00
Philippe Tillet
e7cabf65ac
Tuning: Merged tune branch.
...
- Much cleaner and more concise source
- Better exceptions handling
- Checks local minima to see if retuning is needed.
Resolved conflicts:
bench/blas.cpp
include/isaac/backend/templates/mproduct.h
include/isaac/driver/buffer.h
lib/array.cpp
lib/backend/templates/mproduct.cpp
lib/driver/buffer.cpp
python/setup.py
tune/pysrc/autotune.py
tune/pysrc/dataset.py
tune/pysrc/misc_tools.py
2015-06-28 17:53:16 -07:00
Philippe Tillet
80bcbd095f
C++: Some renaming; added possibility to pass buffers when constructing arrays
2015-06-23 09:38:34 -07:00
Philippe Tillet
05e730f06e
CUDA: Many fixes in the backend
2015-05-13 02:26:38 -04:00
Philippe Tillet
cf5028d55b
Squashed feature branch:
...
* Added CUDA support
* Performance improvements
* API improvements
* Added "depth" parameter to GEMM
* Android cross-compilation
2015-04-29 15:52:21 -04:00
Philippe Tillet
e214927b16
Better control flow through options
2015-02-05 04:43:50 -05:00
Philippe Tillet
bbf2f0188e
Ported to C++11
2015-02-05 04:43:40 -05:00
Philippe Tillet
2afc574724
Implemented simple operation cache
2015-02-01 23:56:05 -05:00
Philippe Tillet
535706f35a
Some renaming; lower overhead in benchmark
2015-02-01 22:28:49 -05:00
Philippe Tillet
d29f1252ad
Clearer array_expression with hopefully lower overhead.
...
Also removed pyc's
2015-01-31 22:01:48 -05:00
Philippe Tillet
d4629ba018
Bugfix in cast and relational operators
2015-01-29 02:50:51 -05:00
Philippe Tillet
c7665021d1
reducing overhead; reverted custom CL/ header because CL/cl.hpp was buggy
2015-01-28 23:04:19 -05:00
Philippe Tillet
c37d8a2a81
Now using system CL include
2015-01-27 16:19:50 -05:00
Philippe Tillet
4a9e16fefd
various bugfixes
2015-01-25 01:08:18 -05:00
Philippe Tillet
4f73fb384f
More flexibility in scalars
2015-01-19 21:29:47 -05:00
Philippe Tillet
16648f18e0
various changes
2015-01-17 15:47:52 -05:00
Philippe Tillet
0068560bc6
Some cleaning + outer product
2015-01-17 10:49:36 -05:00
Philippe Tillet
1d70396711
Adding diag
2015-01-16 20:06:08 -05:00
Philippe Tillet
69311b7982
Now ATIDLAS is standalone. Everything dynamic....
2015-01-12 13:24:06 -05:00