Philippe Tillet
|
fa4cb6866d
|
Bench: Now displaying results in a table
|
2016-09-29 14:50:42 -04:00 |
|
Philippe Tillet
|
ca88c74749
|
Driver: removed useless macro
|
2016-05-31 23:28:32 -04:00 |
|
Philippe Tillet
|
fdc6ff7907
|
Driver: some cleaning
* automatic generation of comparison operators
* better comments
|
2016-05-31 16:41:41 -04:00 |
|
Philippe Tillet
|
81139e0642
|
Codestyle: rename helper_math.cu to vector.cu
|
2016-04-10 15:42:37 -04:00 |
|
Philippe Tillet
|
7f77fba4d4
|
General: Internal code generator overhaul
|
2016-04-02 18:19:33 -04:00 |
|
Philippe Tillet
|
a59337dc9e
|
Driver: minor bugfix in events handling for OpenCL
|
2015-12-22 17:54:20 -05:00 |
|
Philippe Tillet
|
f2c3aa0acb
|
Driver: no longer instantiating events when unnecessary ; general cleaning of the driver module
|
2015-12-21 18:43:05 -05:00 |
|
Philippe Tillet
|
6623116372
|
Licensing: added blank line after license text
|
2015-12-21 17:04:09 -05:00 |
|
Philippe Tillet
|
da43f89ea4
|
Profiles: reorganized database
|
2015-12-21 02:43:04 -05:00 |
|
Philippe Tillet
|
ebbb6dd18e
|
LICENSING: added license headers ; polished files hierarchy
|
2015-12-19 21:43:05 -05:00 |
|
Philippe Tillet
|
04f8b91425
|
Presets: better Maxwell profile
|
2015-12-18 03:38:01 -05:00 |
|
Philippe Tillet
|
b3c5251f91
|
CMake: Fixed clBLAS handling
|
2015-12-12 01:29:08 -05:00 |
|
Philippe Tillet
|
386963a6cc
|
Core: added queue-wise temporary workspace. WARNING: breaks the fused computation of multiple DOT/GEMV operations
|
2015-11-27 18:43:46 -05:00 |
|
Philippe Tillet
|
dfbe52c20a
|
Driver: now ignore CUDA_ERROR_DEINITIALIZED in the destructor of CUDA C++ object.
This should be harmless. ISAAC deinitializes CUDA at the very end, but external libraries may deinitialize it beforehands.
|
2015-11-27 02:09:15 -05:00 |
|
Philippe Tillet
|
c0b9bbee43
|
cuBLAS: fixed CUDA context import
|
2015-11-26 21:09:34 -05:00 |
|
Philippe Tillet
|
6fc94c0c0b
|
Kernels: Fixed various corner cases for the kernel templates and BLAS
|
2015-11-26 19:49:44 -05:00 |
|
Philippe Tillet
|
6be5929b0d
|
Core: fixed handle wrapping for CUcontext
|
2015-11-21 13:57:05 -05:00 |
|
Philippe Tillet
|
c6333c993a
|
API: adding cuBLAS interface
|
2015-11-20 12:46:42 -05:00 |
|
Philippe Tillet
|
e2cdb88338
|
Core: included bugfixes from the SVD branch
|
2015-11-19 12:37:18 -05:00 |
|
Philippe Tillet
|
426ba27d8b
|
Python: now ships vector.cu's string-header
|
2015-08-27 20:28:30 -04:00 |
|
Philippe Tillet
|
b6333c3a6e
|
Tuner: Now pretty-printing progress bar on command line
|
2015-08-27 20:25:02 -04:00 |
|
Philippe Tillet
|
69c11d16cc
|
Code quality: bugfix in bench/test to note call clBLAS on CUDA backend
|
2015-08-26 14:12:50 -04:00 |
|
Philippe Tillet
|
9da87bee51
|
Driver: fixed up invalid option for nvrtc
|
2015-08-26 13:44:40 -04:00 |
|
Philippe Tillet
|
0d3fcb18dc
|
Driver: now using proper compute capability option in nvrtc ; added missing file.
|
2015-08-26 13:31:58 -04:00 |
|
Philippe Tillet
|
cf2d88a0a2
|
Binding: now releasing profiles in clblasTeardown()
|
2015-08-25 19:35:05 -04:00 |
|
Philippe Tillet
|
7b77d5ae4b
|
Driver: bugfixes in CUDA dynamic loading
|
2015-08-25 19:12:02 -04:00 |
|
Philippe Tillet
|
67a35a62bd
|
Driver: now loading the backend dynamically on Linux
|
2015-08-25 17:06:51 -04:00 |
|
Philippe Tillet
|
95f2564c1a
|
Tuning: Android UI improvement
|
2015-08-24 23:03:37 -04:00 |
|
Philippe Tillet
|
10524ebdee
|
CUDA: various improvements
|
2015-08-24 17:03:31 -04:00 |
|
Philippe Tillet
|
33dac6b05a
|
Code quality: fixed compilation errors with CUDA
|
2015-08-20 21:24:41 -04:00 |
|
Philippe Tillet
|
a57ff683e1
|
Code quality: removed obsolete comments
|
2015-08-18 15:34:55 -07:00 |
|
Philippe Tillet
|
feda5a8c9c
|
C++: added access to CL's platform id via driver::Platform
|
2015-08-18 14:21:12 -07:00 |
|
Philippe Tillet
|
da20db711b
|
Tune: Further file hierarchy improvements
|
2015-08-18 02:31:44 -07:00 |
|
U-AMR\ptillet
|
b34c611802
|
Code quality: Added consistency between int_t and size_t. Fixed warnings for Win64
|
2015-08-13 16:00:49 -07:00 |
|
Philippe Tillet
|
f7cb4ac960
|
Code quality: fixed implicit conversions from size_t to int_t
|
2015-08-13 14:30:11 -07:00 |
|
Philippe Tillet
|
29e752c765
|
Presets: Now checking device type when grabbing a preset
|
2015-08-13 13:43:26 -07:00 |
|
Philippe Tillet
|
ff4cf94df7
|
Code quality: significant cleaning of namespaces, etc..
|
2015-08-12 00:47:58 -07:00 |
|
Philippe Tillet
|
b5cc1f7ddc
|
Kernels: Now SizeType is always "int". Right now I don't expect data-structure to have more than 2**31 entries. Improves performance on a number of routines.
|
2015-08-11 11:50:49 -07:00 |
|
Philippe Tillet
|
552f64c9eb
|
Driver: Fixed bug in NDRange
|
2015-08-07 20:56:11 -07:00 |
|
Philippe Tillet
|
1399404f04
|
Code quality: Fixed issue with to_string for proper compilation on Cygwin GCC
|
2015-08-06 20:20:08 -07:00 |
|
Philippe Tillet
|
33fea11547
|
Code quality: more cleaning of files architecture
|
2015-08-06 19:34:26 -07:00 |
|
Philippe Tillet
|
db090d7942
|
Code quality: Large clean-up of the codebase and especially of the include/ folder
|
2015-08-06 12:05:12 -07:00 |
|
Philippe Tillet
|
df9f6142ef
|
Models: added basic database structure
|
2015-08-05 17:01:42 -07:00 |
|
Philippe Tillet
|
08e4a5b9a6
|
Code quality: More MSVC warning fixing
|
2015-08-05 11:26:09 -07:00 |
|
Philippe Tillet
|
35b2550665
|
Code quality: safer getenv on windows
|
2015-08-05 11:16:14 -07:00 |
|
Philippe Tillet
|
dd61041df4
|
Driver: moved ProgramCache to a separate file
|
2015-08-04 11:11:38 -07:00 |
|
Philippe Tillet
|
9ef5331a1b
|
Models: more robust programs caching
|
2015-08-04 10:53:39 -07:00 |
|
Philippe Tillet
|
d88ff6b39b
|
Driver: dirty fix of programs caching with auto-tuner
|
2015-08-04 00:53:52 -07:00 |
|
Philippe Tillet
|
5b58769fbc
|
Code quality: some more cleaning
|
2015-08-03 22:50:49 -07:00 |
|
Philippe Tillet
|
1565363722
|
Driver: reverted lookup map key (for queues/program) to objects rather than pointers
|
2015-08-03 17:37:19 -07:00 |
|