Philippe Tillet
c6333c993a
API: adding cuBLAS interface
2015-11-20 12:46:42 -05:00
Philippe Tillet
da1b0a9571
GEMM: performance regression fix
2015-11-19 20:49:38 -05:00
Philippe Tillet
1e076c131b
API: clearer interface for transposition
2015-10-01 21:58:59 -04:00
Philippe Tillet
feeb1e9862
Feature: Merged kernel-fusion branch
...
* Fuses multiple AXPY kernel
* Possibility to add thread-wise for loops in AXPY-like kernels
2015-09-30 15:31:41 -04:00
Philippe Tillet
149441b9e2
Bench: improved output formatting
2015-08-31 13:35:29 -04:00
Philippe Tillet
836a955663
GEMV: bugfix with CUDA
2015-08-30 02:35:55 -04:00
Philippe Tillet
8dcf062342
Benchmarks: added consistency between CUDA and the rest
2015-08-27 22:55:20 -04:00
Philippe Tillet
6676b94d00
Bench: no longer reallocating memory for CUDA.
2015-08-27 19:09:22 -04:00
Philippe Tillet
f06a3bdf53
Bugfix: fixed bug in dynamic kernel selection
2015-08-26 19:11:09 -04:00
Philippe Tillet
ffb3c01b77
Code quality: fixed typo
2015-08-26 14:24:12 -04:00
Philippe Tillet
69c11d16cc
Code quality: bugfix in bench/test to note call clBLAS on CUDA backend
2015-08-26 14:12:50 -04:00
Philippe Tillet
10524ebdee
CUDA: various improvements
2015-08-24 17:03:31 -04:00
Philippe Tillet
33dac6b05a
Code quality: fixed compilation errors with CUDA
2015-08-20 21:24:41 -04:00
Philippe Tillet
efdbf5f4a6
Bench: Added LeNet sizes
2015-08-18 16:44:35 -07:00
Philippe Tillet
470ae7f4be
Benchmarks: More human-readable labels
2015-08-17 16:32:10 -07:00
Philippe Tillet
99e2592c34
Benchmarks: Fixed MKL setup
2015-08-13 17:20:41 -07:00
Philippe Tillet
71224a1507
Code quality: renamed isaac shortcut from isc to sc
2015-08-12 19:38:53 -07:00
Philippe Tillet
ff4cf94df7
Code quality: significant cleaning of namespaces, etc..
2015-08-12 00:47:58 -07:00
Philippe Tillet
f60b82af25
Kernels: more generic temporary workspace checks
2015-08-10 10:19:50 -07:00
Philippe Tillet
2dc351f932
Benchmarks: removed useless linebreaks after some benchmarks
2015-08-10 09:38:13 -07:00
Philippe Tillet
89b0873c24
Benchmarks: now benchmarking the GEMV sizes of AlexNet
2015-08-10 09:37:44 -07:00
Philippe Tillet
0f0946b7a7
CMake: Now converting JSon to source with EOG / Warnings fix on GCC
2015-08-06 17:54:04 -07:00
Philippe Tillet
db090d7942
Code quality: Large clean-up of the codebase and especially of the include/ folder
2015-08-06 12:05:12 -07:00
Philippe Tillet
ebab87af61
Code quality: everything now compiles without warnings on MSVC
2015-08-05 13:01:20 -07:00
Philippe Tillet
ef14e3aa17
Code quality: more MSVD warnings removal
2015-08-05 12:47:20 -07:00
Philippe Tillet
54c4a31af0
Code quality: more warning fixes
2015-08-05 12:07:51 -07:00
Philippe Tillet
f4c597b294
Code quality: fixed compilation errors/warnings with Clang
2015-08-05 09:26:50 -07:00
Philippe Tillet
5b58769fbc
Code quality: some more cleaning
2015-08-03 22:50:49 -07:00
Philippe Tillet
1565363722
Driver: reverted lookup map key (for queues/program) to objects rather than pointers
2015-08-03 17:37:19 -07:00
Philippe Tillet
92d8afc888
Code quality: cleaned backend terminology
2015-08-03 17:11:24 -07:00
Philippe Tillet
dd5f592fd8
Code quality: destructors of static global objects will no longer call external dependencies. Manual cleaning possible through backend::release()
...
This should solve errors when libOpenCL's ICD was unloaded prior to libisaac.
2015-07-31 15:47:41 -07:00
Philippe Tillet
dfe3d26f5e
Misc: Fixed compilation issues on tests/bench
2015-07-31 01:02:59 -07:00
Philippe Tillet
29bdf7f546
Code quality: made the backend static
2015-07-30 21:15:37 -07:00
Philippe Tillet
89ee015f7f
General: Bugfixes here and there
2015-07-27 11:37:19 -07:00
Philippe Tillet
a2b533b9a8
Driver: made cl and cu attributes private in Handle<>
2015-07-23 09:40:18 -07:00
Philippe Tillet
cbe930398e
Code quality: Cleaned directory folder, variable names and mingw compliance
2015-07-21 13:29:23 -04:00
Philippe Tillet
f4615446c5
GEMM: More optimizations
2015-07-18 17:23:53 -04:00
Philippe Tillet
6ccf32904a
GEMM: Still optimizing
2015-07-18 16:06:17 -04:00
Philippe Tillet
753a9b1f3e
Benchmarks: now benchmaring all AlexNet sizes
2015-07-14 13:33:23 -04:00
Philippe Tillet
281fa9c7a6
Benchmarks: Now testing AlexNet's size
2015-07-10 16:05:28 -04:00
Philippe Tillet
e7cabf65ac
Tuning: Merged tune branch.
...
- Much cleaner and more concise source
- Better exceptions handling
- Checks local minima to see if retuning is needed.
Resolved conflicts:
bench/blas.cpp
include/isaac/backend/templates/mproduct.h
include/isaac/driver/buffer.h
lib/array.cpp
lib/backend/templates/mproduct.cpp
lib/driver/buffer.cpp
python/setup.py
tune/pysrc/autotune.py
tune/pysrc/dataset.py
tune/pysrc/misc_tools.py
2015-06-28 17:53:16 -07:00
Philippe Tillet
b32de3ac76
C++: More clBLAS routines
2015-06-25 08:12:16 -07:00
Philippe Tillet
9f7e34ba5d
C++: Added clBLAS sGEMM ABI (still buggy)
2015-06-24 07:51:27 -07:00
Philippe Tillet
a67476671d
Bench: Removed warnings in bench-blas when no external blas is defined
2015-04-29 16:11:32 -04:00
Philippe Tillet
cf5028d55b
Squashed feature branch:
...
* Added CUDA support
* Performance improvements
* API improvements
* Added "depth" parameter to GEMM
* Android cross-compilation
2015-04-29 15:52:21 -04:00
Philippe Tillet
5ff16bfcb6
Added cublas sgemm
2015-02-13 04:31:42 -05:00
Philippe Tillet
e453031094
More efficient access pattern in the GEMV kernel
2015-02-11 02:06:16 -05:00
Philippe Tillet
37fc98c532
Fixed bug on marix-vector products with vectorization
2015-02-10 03:09:41 -05:00
Philippe Tillet
a89f6d88be
Fix bug in operation-specific tuning
2015-02-09 01:58:32 -05:00
Philippe Tillet
a6d7671831
removing C++11 interface
2015-02-08 23:19:38 -05:00