Philippe Tillet
4c123c4b38
Backend: GEMM - Improved bounds checking
2015-07-02 16:44:02 -04:00
Philippe Tillet
e7cabf65ac
Tuning: Merged tune branch.
...
- Much cleaner and more concise source
- Better exceptions handling
- Checks local minima to see if retuning is needed.
Resolved conflicts:
bench/blas.cpp
include/isaac/backend/templates/mproduct.h
include/isaac/driver/buffer.h
lib/array.cpp
lib/backend/templates/mproduct.cpp
lib/driver/buffer.cpp
python/setup.py
tune/pysrc/autotune.py
tune/pysrc/dataset.py
tune/pysrc/misc_tools.py
2015-06-28 17:53:16 -07:00
Philippe Tillet
9f7e34ba5d
C++: Added clBLAS sGEMM ABI (still buggy)
2015-06-24 07:51:27 -07:00
Philippe Tillet
cf5028d55b
Squashed feature branch:
...
* Added CUDA support
* Performance improvements
* API improvements
* Added "depth" parameter to GEMM
* Android cross-compilation
2015-04-29 15:52:21 -04:00
Philippe Tillet
a6d7671831
removing C++11 interface
2015-02-08 23:19:38 -05:00
Philippe
385f007c0b
Fixed overhead-benchmark
2015-02-06 02:00:02 -05:00
Philippe
7fc2348924
Fixed CUDA benchmark
2015-02-05 23:42:31 -05:00
Philippe Tillet
8f8b01938b
Rearranged benchmarking script
2015-02-05 23:11:16 -05:00
Philippe Tillet
3f1fa822f8
save
2015-02-01 15:58:05 -05:00
Philippe Tillet
13ec84fbda
Bugfix in benchmark's cmakelists
2015-01-29 22:40:41 +01:00
Philippe Tillet
04cec21752
Fixed warnings and compilation for pyatidlas
2015-01-28 19:50:47 -05:00
Philippe Tillet
46836753aa
Bugfix in building cuda executable
2015-01-27 21:42:52 +01:00
Philippe Tillet
c12ec4cebd
tentative cuda benchmark integration
...
x
2015-01-27 15:32:59 -05:00
Philippe Tillet
be006268d7
More robust build system
2015-01-27 15:12:08 -05:00
Philippe Tillet
6d7f103ab1
Added GEMM benchmark
2015-01-25 18:19:19 -05:00
Philippe Tillet
da9defac04
No more random order in default initialization
...
Added linking to MKL/CBLAS
2015-01-24 14:51:48 -05:00
Philippe Tillet
ad05dea33f
Added benchmark for expression tree creation
2015-01-18 17:12:09 -05:00
Philippe Tillet
69311b7982
Now ATIDLAS is standalone. Everything dynamic....
2015-01-12 13:24:06 -05:00
Philippe Tillet
6049f0dfed
Bench: Removed OpenCL with CUBLAS
2014-11-06 07:21:27 -05:00
Philippe Tillet
146f54b32c
Improved benchmark suite
2014-11-06 07:07:27 -05:00
Philippe Tillet
bdeb18429b
Added CuBlas benchmark
2014-10-30 20:42:23 -04:00
Philippe Tillet
740ba08c5c
Added basic BLAS1 benchmark
2014-10-27 05:35:04 -04:00