Philippe Tillet
ebbb6dd18e
LICENSING: added license headers ; polished files hierarchy
2015-12-19 21:43:05 -05:00
Philippe Tillet
b3c5251f91
CMake: Fixed clBLAS handling
2015-12-12 01:29:08 -05:00
Philippe Tillet
386963a6cc
Core: added queue-wise temporary workspace. WARNING: breaks the fused computation of multiple DOT/GEMV operations
2015-11-27 18:43:46 -05:00
Philippe Tillet
c0b9bbee43
cuBLAS: fixed CUDA context import
2015-11-26 21:09:34 -05:00
Philippe Tillet
6fc94c0c0b
Kernels: Fixed various corner cases for the kernel templates and BLAS
2015-11-26 19:49:44 -05:00
Philippe Tillet
6be5929b0d
Core: fixed handle wrapping for CUcontext
2015-11-21 13:57:05 -05:00
Philippe Tillet
67a35a62bd
Driver: now loading the backend dynamically on Linux
2015-08-25 17:06:51 -04:00
Philippe Tillet
33dac6b05a
Code quality: fixed compilation errors with CUDA
2015-08-20 21:24:41 -04:00
Philippe Tillet
db090d7942
Code quality: Large clean-up of the codebase and especially of the include/ folder
2015-08-06 12:05:12 -07:00
Philippe Tillet
35b2550665
Code quality: safer getenv on windows
2015-08-05 11:16:14 -07:00
Philippe Tillet
afc4ecee98
Driver: Back to global programs caching
2015-07-31 00:43:17 -07:00
Philippe Tillet
81b9f01336
Driver: Contexts are now unique and non-copyable
2015-07-31 00:41:03 -07:00
Philippe Tillet
21a2566904
Driver: moved programs allocation logic to a static variable
2015-07-30 14:35:41 -07:00
Philippe Tillet
902805acea
Driver: perhaps better ownership control
2015-07-28 17:39:52 -07:00
Philippe Tillet
10745fc013
Driver: other bugfixes
2015-07-27 17:20:12 -07:00
Philippe Tillet
89ee015f7f
General: Bugfixes here and there
2015-07-27 11:37:19 -07:00
Philippe Tillet
4715723e61
Driver: Fixed issue in ownership handling for BLAS
2015-07-26 21:13:28 -07:00
Philippe Tillet
0ef6654c5f
Code quality: removed dependencies on the C++ OpenCL wrapper
2015-07-26 10:05:16 -07:00
Philippe Tillet
a2b533b9a8
Driver: made cl and cu attributes private in Handle<>
2015-07-23 09:40:18 -07:00
Philippe Tillet
e7cabf65ac
Tuning: Merged tune branch.
...
- Much cleaner and more concise source
- Better exceptions handling
- Checks local minima to see if retuning is needed.
Resolved conflicts:
bench/blas.cpp
include/isaac/backend/templates/mproduct.h
include/isaac/driver/buffer.h
lib/array.cpp
lib/backend/templates/mproduct.cpp
lib/driver/buffer.cpp
python/setup.py
tune/pysrc/autotune.py
tune/pysrc/dataset.py
tune/pysrc/misc_tools.py
2015-06-28 17:53:16 -07:00
Philippe Tillet
80bcbd095f
C++: Some renaming; added possibility to pass buffers when constructing arrays
2015-06-23 09:38:34 -07:00
Philippe Tillet
cf5028d55b
Squashed feature branch:
...
* Added CUDA support
* Performance improvements
* API improvements
* Added "depth" parameter to GEMM
* Android cross-compilation
2015-04-29 15:52:21 -04:00