Philippe Tillet
|
373771c796
|
Tuner: more polishing of intermediate BLAS3 sizes
|
2015-12-18 03:38:01 -05:00 |
|
Philippe Tillet
|
b89dc9a9ea
|
Code Quality: More renaming
|
2015-12-16 18:19:33 -05:00 |
|
Philippe Tillet
|
83feed534c
|
Code quality: more renaming
|
2015-12-16 16:34:36 -05:00 |
|
Philippe Tillet
|
042aa070bb
|
Code Quality: More sensible names
|
2015-12-12 21:19:59 -05:00 |
|
Philippe Tillet
|
46dad59e10
|
Tests: Fixed typos and polished test names
|
2015-12-12 13:31:14 -05:00 |
|
Philippe Tillet
|
b3c5251f91
|
CMake: Fixed clBLAS handling
|
2015-12-12 01:29:08 -05:00 |
|
Philippe Tillet
|
386963a6cc
|
Core: added queue-wise temporary workspace. WARNING: breaks the fused computation of multiple DOT/GEMV operations
|
2015-11-27 18:43:46 -05:00 |
|
Philippe Tillet
|
c6333c993a
|
API: adding cuBLAS interface
|
2015-11-20 12:46:42 -05:00 |
|
Philippe Tillet
|
bc20cc1ed7
|
Python: updated wrapper to match C++ API
|
2015-11-19 19:22:11 -05:00 |
|
Philippe Tillet
|
a843477438
|
CMake: removed legacy debug code
|
2015-11-19 18:33:38 -05:00 |
|
Philippe Tillet
|
e2cdb88338
|
Core: included bugfixes from the SVD branch
|
2015-11-19 12:37:18 -05:00 |
|
Philippe Tillet
|
ce07e490f6
|
Examples: polished tutorial
|
2015-10-08 20:43:04 -04:00 |
|
Philippe Tillet
|
1e076c131b
|
API: clearer interface for transposition
|
2015-10-01 21:58:59 -04:00 |
|
Philippe Tillet
|
feeb1e9862
|
Feature: Merged kernel-fusion branch
* Fuses multiple AXPY kernel
* Possibility to add thread-wise for loops in AXPY-like kernels
|
2015-09-30 15:31:41 -04:00 |
|
Philippe Tillet
|
1e77703f7f
|
Android: various fixes
|
2015-08-28 13:48:54 -04:00 |
|
Philippe Tillet
|
de159ca829
|
Python: fixed minor error in kernels.cpp
|
2015-08-27 20:27:14 -04:00 |
|
Philippe Tillet
|
69c11d16cc
|
Code quality: bugfix in bench/test to note call clBLAS on CUDA backend
|
2015-08-26 14:12:50 -04:00 |
|
Philippe Tillet
|
5d8a092ed8
|
Code quality: removed dead code related to obsolete static backend selection
|
2015-08-25 23:51:54 -04:00 |
|
Philippe Tillet
|
7b77d5ae4b
|
Driver: bugfixes in CUDA dynamic loading
|
2015-08-25 19:12:02 -04:00 |
|
Philippe Tillet
|
67a35a62bd
|
Driver: now loading the backend dynamically on Linux
|
2015-08-25 17:06:51 -04:00 |
|
Philippe Tillet
|
95f2564c1a
|
Tuning: Android UI improvement
|
2015-08-24 23:03:37 -04:00 |
|
Philippe Tillet
|
10524ebdee
|
CUDA: various improvements
|
2015-08-24 17:03:31 -04:00 |
|
Philippe Tillet
|
33dac6b05a
|
Code quality: fixed compilation errors with CUDA
|
2015-08-20 21:24:41 -04:00 |
|
Philippe Tillet
|
e5e1e0b55b
|
Python: Fixed typo
|
2015-08-18 14:34:13 -07:00 |
|
Philippe Tillet
|
9b0eb09f2e
|
Python: Installation bugfixes
|
2015-08-18 14:18:33 -07:00 |
|
Philippe Tillet
|
da20db711b
|
Tune: Further file hierarchy improvements
|
2015-08-18 02:31:44 -07:00 |
|
Philippe Tillet
|
0142936ff8
|
Tuner: Moved tuning logic into the python wrapper - draft of Android tuning app using kivy
|
2015-08-16 19:58:54 -07:00 |
|
Philippe Tillet
|
e912beaac3
|
Python: Updated boost to 1.58 - Kivy support in python wrapper
|
2015-08-14 22:15:05 -07:00 |
|
Philippe Tillet
|
8c7259bb3d
|
Python: fixed setup.py for external sklearn.tree usage
|
2015-08-14 10:06:37 -07:00 |
|
Philippe Tillet
|
ae9c87e224
|
Android: restored compatibility
|
2015-08-14 01:11:46 -07:00 |
|
Philippe Tillet
|
97f47847e4
|
Python: Updated driver.cpp to fit the previous changes
|
2015-08-13 18:27:43 -07:00 |
|
Philippe Tillet
|
bc66f7a1bf
|
Python wrapper: added missing files
|
2015-08-13 18:17:27 -07:00 |
|
U-AMR\ptillet
|
9fda50863a
|
Code quality: made array constructors explicit ; added missing include for <accumulate>
|
2015-08-13 14:01:34 -07:00 |
|
Philippe Tillet
|
29e752c765
|
Presets: Now checking device type when grabbing a preset
|
2015-08-13 13:43:26 -07:00 |
|
Philippe Tillet
|
0bb73602f9
|
Kernel: Merged gemv-simd code
|
2015-08-13 10:15:32 -07:00 |
|
Philippe Tillet
|
cd86d0234c
|
Tuner: Removed Sklearn + scipy dependency
|
2015-08-13 01:51:02 -07:00 |
|
Philippe Tillet
|
d49a541fd4
|
Python: Added external random forest support
|
2015-08-13 01:17:35 -07:00 |
|
Philippe Tillet
|
cc2d3416be
|
Python: added standalone decision tree from sklearn
|
2015-08-12 21:59:59 -07:00 |
|
Philippe Tillet
|
71224a1507
|
Code quality: renamed isaac shortcut from isc to sc
|
2015-08-12 19:38:53 -07:00 |
|
Philippe Tillet
|
cd51f75eb5
|
Python: added some new examples
|
2015-08-12 00:48:16 -07:00 |
|
Philippe Tillet
|
ff4cf94df7
|
Code quality: significant cleaning of namespaces, etc..
|
2015-08-12 00:47:58 -07:00 |
|
Philippe Tillet
|
5a8cfede45
|
Code quality: renamed model/ to database/
|
2015-08-11 20:18:39 -07:00 |
|
U-AMR\ptillet
|
0a506fa433
|
Code quality: fixed build issues with MSVC
|
2015-08-10 17:04:57 -07:00 |
|
Philippe Tillet
|
0f23cec87d
|
Python: added simple tutorial for benchmarking different GEMM profiles
|
2015-08-10 11:01:58 -07:00 |
|
Philippe Tillet
|
d6980d0cec
|
Python binding: fixed another typo...
|
2015-08-10 09:51:02 -07:00 |
|
Philippe Tillet
|
43b10ad509
|
Python wrapper: fixed typo...
|
2015-08-10 07:43:05 -07:00 |
|
U-AMR\ptillet
|
a33492cd29
|
Python Wrapper: Fix for Cygwin on Windows
|
2015-08-09 16:56:13 -07:00 |
|
Philippe Tillet
|
58d7e44457
|
Python Wrapper : Kernels - migrated local size parameters to base_impl<>
|
2015-08-09 16:41:37 -07:00 |
|
Philippe Tillet
|
a986bc3a17
|
Python: Reverted to Boost 1.55
|
2015-08-07 01:00:52 -07:00 |
|
Philippe Tillet
|
623c5f5e85
|
Python: Fixed compilation issues on Cygwin GCC
|
2015-08-07 00:22:54 -07:00 |
|