Commit Graph

18 Commits

Author SHA1 Message Date
Philippe Tillet
46dad59e10 Tests: Fixed typos and polished test names 2015-12-12 13:31:14 -05:00
Philippe Tillet
004eebc038 Kernels: fixed kernels fusion for DOT, GEMV 2015-12-05 19:14:09 -05:00
Philippe Tillet
386963a6cc Core: added queue-wise temporary workspace. WARNING: breaks the fused computation of multiple DOT/GEMV operations 2015-11-27 18:43:46 -05:00
Philippe Tillet
6fc94c0c0b Kernels: Fixed various corner cases for the kernel templates and BLAS 2015-11-26 19:49:44 -05:00
Philippe Tillet
e2cdb88338 Core: included bugfixes from the SVD branch 2015-11-19 12:37:18 -05:00
Philippe Tillet
feeb1e9862 Feature: Merged kernel-fusion branch
* Fuses multiple AXPY kernel
* Possibility to add thread-wise for loops in AXPY-like kernels
2015-09-30 15:31:41 -04:00
Philippe Tillet
836a955663 GEMV: bugfix with CUDA 2015-08-30 02:35:55 -04:00
Philippe Tillet
67a35a62bd Driver: now loading the backend dynamically on Linux 2015-08-25 17:06:51 -04:00
Philippe Tillet
0bb73602f9 Kernel: Merged gemv-simd code 2015-08-13 10:15:32 -07:00
Philippe Tillet
b5cc1f7ddc Kernels: Now SizeType is always "int". Right now I don't expect data-structure to have more than 2**31 entries. Improves performance on a number of routines. 2015-08-11 11:50:49 -07:00
Philippe Tillet
963867574f Kernels: merged start1, start2 and stride1, stride2 into start and stride for matrices 2015-08-10 22:45:48 -07:00
Philippe Tillet
1399404f04 Code quality: Fixed issue with to_string for proper compilation on Cygwin GCC 2015-08-06 20:20:08 -07:00
Philippe Tillet
33fea11547 Code quality: more cleaning of files architecture 2015-08-06 19:34:26 -07:00
Philippe Tillet
e4ff883688 Code quality: Cleaned a bit file hierarchy in kernel templates 2015-08-06 16:14:33 -07:00
Philippe Tillet
db090d7942 Code quality: Large clean-up of the codebase and especially of the include/ folder 2015-08-06 12:05:12 -07:00
Philippe Tillet
bb4d2d62e3 Code quality: disabled the use of strcat / sprintf for safety issues on windows... 2015-08-05 11:42:08 -07:00
Philippe Tillet
f4c597b294 Code quality: fixed compilation errors/warnings with Clang 2015-08-05 09:26:50 -07:00
Philippe Tillet
1a42494411 Code quality: renamed "backend/" folder to "kernels". More explicit and no longer conflicts with "driver/" 2015-08-04 20:56:05 -07:00