Commit Graph

77 Commits

Author SHA1 Message Date
Philippe Tillet
fa4cb6866d Bench: Now displaying results in a table 2016-09-29 14:50:42 -04:00
Philippe Tillet
ca88c74749 Driver: removed useless macro 2016-05-31 23:28:32 -04:00
Philippe Tillet
fdc6ff7907 Driver: some cleaning
* automatic generation of comparison operators
* better comments
2016-05-31 16:41:41 -04:00
Philippe Tillet
81139e0642 Codestyle: rename helper_math.cu to vector.cu 2016-04-10 15:42:37 -04:00
Philippe Tillet
7f77fba4d4 General: Internal code generator overhaul 2016-04-02 18:19:33 -04:00
Philippe Tillet
a59337dc9e Driver: minor bugfix in events handling for OpenCL 2015-12-22 17:54:20 -05:00
Philippe Tillet
f2c3aa0acb Driver: no longer instantiating events when unnecessary ; general cleaning of the driver module 2015-12-21 18:43:05 -05:00
Philippe Tillet
6623116372 Licensing: added blank line after license text 2015-12-21 17:04:09 -05:00
Philippe Tillet
da43f89ea4 Profiles: reorganized database 2015-12-21 02:43:04 -05:00
Philippe Tillet
ebbb6dd18e LICENSING: added license headers ; polished files hierarchy 2015-12-19 21:43:05 -05:00
Philippe Tillet
04f8b91425 Presets: better Maxwell profile 2015-12-18 03:38:01 -05:00
Philippe Tillet
b3c5251f91 CMake: Fixed clBLAS handling 2015-12-12 01:29:08 -05:00
Philippe Tillet
386963a6cc Core: added queue-wise temporary workspace. WARNING: breaks the fused computation of multiple DOT/GEMV operations 2015-11-27 18:43:46 -05:00
Philippe Tillet
dfbe52c20a Driver: now ignore CUDA_ERROR_DEINITIALIZED in the destructor of CUDA C++ object.
This should be harmless. ISAAC deinitializes CUDA at the very end, but external libraries may deinitialize it beforehands.
2015-11-27 02:09:15 -05:00
Philippe Tillet
c0b9bbee43 cuBLAS: fixed CUDA context import 2015-11-26 21:09:34 -05:00
Philippe Tillet
6fc94c0c0b Kernels: Fixed various corner cases for the kernel templates and BLAS 2015-11-26 19:49:44 -05:00
Philippe Tillet
6be5929b0d Core: fixed handle wrapping for CUcontext 2015-11-21 13:57:05 -05:00
Philippe Tillet
c6333c993a API: adding cuBLAS interface 2015-11-20 12:46:42 -05:00
Philippe Tillet
e2cdb88338 Core: included bugfixes from the SVD branch 2015-11-19 12:37:18 -05:00
Philippe Tillet
426ba27d8b Python: now ships vector.cu's string-header 2015-08-27 20:28:30 -04:00
Philippe Tillet
b6333c3a6e Tuner: Now pretty-printing progress bar on command line 2015-08-27 20:25:02 -04:00
Philippe Tillet
69c11d16cc Code quality: bugfix in bench/test to note call clBLAS on CUDA backend 2015-08-26 14:12:50 -04:00
Philippe Tillet
9da87bee51 Driver: fixed up invalid option for nvrtc 2015-08-26 13:44:40 -04:00
Philippe Tillet
0d3fcb18dc Driver: now using proper compute capability option in nvrtc ; added missing file. 2015-08-26 13:31:58 -04:00
Philippe Tillet
cf2d88a0a2 Binding: now releasing profiles in clblasTeardown() 2015-08-25 19:35:05 -04:00
Philippe Tillet
7b77d5ae4b Driver: bugfixes in CUDA dynamic loading 2015-08-25 19:12:02 -04:00
Philippe Tillet
67a35a62bd Driver: now loading the backend dynamically on Linux 2015-08-25 17:06:51 -04:00
Philippe Tillet
95f2564c1a Tuning: Android UI improvement 2015-08-24 23:03:37 -04:00
Philippe Tillet
10524ebdee CUDA: various improvements 2015-08-24 17:03:31 -04:00
Philippe Tillet
33dac6b05a Code quality: fixed compilation errors with CUDA 2015-08-20 21:24:41 -04:00
Philippe Tillet
a57ff683e1 Code quality: removed obsolete comments 2015-08-18 15:34:55 -07:00
Philippe Tillet
feda5a8c9c C++: added access to CL's platform id via driver::Platform 2015-08-18 14:21:12 -07:00
Philippe Tillet
da20db711b Tune: Further file hierarchy improvements 2015-08-18 02:31:44 -07:00
U-AMR\ptillet
b34c611802 Code quality: Added consistency between int_t and size_t. Fixed warnings for Win64 2015-08-13 16:00:49 -07:00
Philippe Tillet
f7cb4ac960 Code quality: fixed implicit conversions from size_t to int_t 2015-08-13 14:30:11 -07:00
Philippe Tillet
29e752c765 Presets: Now checking device type when grabbing a preset 2015-08-13 13:43:26 -07:00
Philippe Tillet
ff4cf94df7 Code quality: significant cleaning of namespaces, etc.. 2015-08-12 00:47:58 -07:00
Philippe Tillet
b5cc1f7ddc Kernels: Now SizeType is always "int". Right now I don't expect data-structure to have more than 2**31 entries. Improves performance on a number of routines. 2015-08-11 11:50:49 -07:00
Philippe Tillet
552f64c9eb Driver: Fixed bug in NDRange 2015-08-07 20:56:11 -07:00
Philippe Tillet
1399404f04 Code quality: Fixed issue with to_string for proper compilation on Cygwin GCC 2015-08-06 20:20:08 -07:00
Philippe Tillet
33fea11547 Code quality: more cleaning of files architecture 2015-08-06 19:34:26 -07:00
Philippe Tillet
db090d7942 Code quality: Large clean-up of the codebase and especially of the include/ folder 2015-08-06 12:05:12 -07:00
Philippe Tillet
df9f6142ef Models: added basic database structure 2015-08-05 17:01:42 -07:00
Philippe Tillet
08e4a5b9a6 Code quality: More MSVC warning fixing 2015-08-05 11:26:09 -07:00
Philippe Tillet
35b2550665 Code quality: safer getenv on windows 2015-08-05 11:16:14 -07:00
Philippe Tillet
dd61041df4 Driver: moved ProgramCache to a separate file 2015-08-04 11:11:38 -07:00
Philippe Tillet
9ef5331a1b Models: more robust programs caching 2015-08-04 10:53:39 -07:00
Philippe Tillet
d88ff6b39b Driver: dirty fix of programs caching with auto-tuner 2015-08-04 00:53:52 -07:00
Philippe Tillet
5b58769fbc Code quality: some more cleaning 2015-08-03 22:50:49 -07:00
Philippe Tillet
1565363722 Driver: reverted lookup map key (for queues/program) to objects rather than pointers 2015-08-03 17:37:19 -07:00