Philippe Tillet
|
80bcbd095f
|
C++: Some renaming; added possibility to pass buffers when constructing arrays
|
2015-06-23 09:38:34 -07:00 |
|
Philippe Tillet
|
0c9bf8da4e
|
Python: Added find_library in cmake/python/setup.py
|
2015-05-16 16:16:02 -04:00 |
|
Philippe Tillet
|
05e730f06e
|
CUDA: Many fixes in the backend
|
2015-05-13 02:26:38 -04:00 |
|
Philippe Tillet
|
f55e499ef5
|
C++: added support for [unsigned] long long
|
2015-05-04 23:54:43 -04:00 |
|
Philippe Tillet
|
e1506097b2
|
Python: now removing the build directory while packaging
|
2015-05-04 21:26:27 -04:00 |
|
Philippe Tillet
|
278109eef8
|
C++: Now using standard C++ types instead of stdint
|
2015-05-04 21:23:05 -04:00 |
|
Philippe Tillet
|
5cdbef7b4e
|
C++: in value_scalar replaced cl types by stdint types
|
2015-05-04 19:05:32 -04:00 |
|
Philippe Tillet
|
4a44361e06
|
Python: fixed setup.py
|
2015-05-04 02:15:29 -04:00 |
|
Philippe Tillet
|
3b983cf32f
|
CMake: some cleaning
|
2015-05-03 17:41:23 -04:00 |
|
Philippe Tillet
|
217de21991
|
Python: now bundling python package
|
2015-05-03 04:02:19 -04:00 |
|
Philippe Tillet
|
6b61bd3169
|
Python: added symlinks for src/include and src/lib
|
2015-05-03 03:44:53 -04:00 |
|
Philippe Tillet
|
7dba13c206
|
Structure: Python module is now standalone (setup.py generated by cmake). Better directory hierarchy
|
2015-05-03 03:40:14 -04:00 |
|
Philippe Tillet
|
29bfff69e9
|
Python: now statically linking (i.e. recompiling everything) for better portability.
|
2015-05-03 02:14:02 -04:00 |
|
Philippe Tillet
|
25242bc19d
|
Android: added simple test application
|
2015-05-02 23:16:56 -04:00 |
|
Philippe Tillet
|
d1092abd5f
|
CMake: added bin2cpp
|
2015-05-01 21:40:11 -04:00 |
|
Philippe Tillet
|
9f186342a5
|
API: changed axis convention to fit numpy
|
2015-05-01 15:58:08 -04:00 |
|
Philippe Tillet
|
1f9d0f0f14
|
CMake: Deactivated pyinstaller
|
2015-05-01 11:35:41 -04:00 |
|
Philippe Tillet
|
1a3923c562
|
Python: Wrapped exception
|
2015-05-01 11:29:39 -04:00 |
|
Philippe Tillet
|
86ffc7f6a6
|
Python: Better file-architecture in wrapper
|
2015-04-30 20:37:12 -04:00 |
|
Philippe Tillet
|
fffdc1b79e
|
Revert: Deleted additional crappy file...
|
2015-04-30 02:47:15 -04:00 |
|
Philippe Tillet
|
a65e8c32e0
|
Revert: Removed build files / old python wrapper
|
2015-04-30 02:42:09 -04:00 |
|
Philippe Tillet
|
e29fda1b1c
|
test
|
2015-04-30 02:25:14 -04:00 |
|
Philippe Tillet
|
006d0f13de
|
Packaging: polished
|
2015-04-30 00:46:42 -04:00 |
|
Philippe Tillet
|
5ef01f041a
|
Python: Refactored wrapper
|
2015-04-29 17:48:57 -04:00 |
|
Philippe Tillet
|
a67476671d
|
Bench: Removed warnings in bench-blas when no external blas is defined
|
2015-04-29 16:11:32 -04:00 |
|
Philippe Tillet
|
cf5028d55b
|
Squashed feature branch:
* Added CUDA support
* Performance improvements
* API improvements
* Added "depth" parameter to GEMM
* Android cross-compilation
|
2015-04-29 15:52:21 -04:00 |
|
Philippe Tillet
|
5ff16bfcb6
|
Added cublas sgemm
|
2015-02-13 04:31:42 -05:00 |
|
Philippe Tillet
|
e453031094
|
More efficient access pattern in the GEMV kernel
|
2015-02-11 02:06:16 -05:00 |
|
Philippe Tillet
|
85b7eb8b5e
|
Added another parameter to GEMV
|
2015-02-10 16:33:38 -05:00 |
|
Philippe Tillet
|
37fc98c532
|
Fixed bug on marix-vector products with vectorization
|
2015-02-10 03:09:41 -05:00 |
|
Philippe Tillet
|
a89f6d88be
|
Fix bug in operation-specific tuning
|
2015-02-09 01:58:32 -05:00 |
|
Philippe Tillet
|
7e65601534
|
fixup
|
2015-02-08 23:22:48 -05:00 |
|
Philippe Tillet
|
a6d7671831
|
removing C++11 interface
|
2015-02-08 23:19:38 -05:00 |
|
Philippe Tillet
|
85fb438806
|
More convenient use of specific runtime tuning
|
2015-02-08 14:23:38 -05:00 |
|
Philippe Tillet
|
9c68704f09
|
Now using a list of event instead of a single one
|
2015-02-08 00:56:24 -05:00 |
|
Philippe Tillet
|
b768e913c9
|
Now using events to time autotuning
|
2015-02-06 22:11:03 -05:00 |
|
Philippe
|
385f007c0b
|
Fixed overhead-benchmark
|
2015-02-06 02:00:02 -05:00 |
|
Philippe
|
7fc2348924
|
Fixed CUDA benchmark
|
2015-02-05 23:42:31 -05:00 |
|
Philippe Tillet
|
58fdc5d18e
|
Added FindOpenBlas
|
2015-02-05 23:17:42 -05:00 |
|
Philippe Tillet
|
8f8b01938b
|
Rearranged benchmarking script
|
2015-02-05 23:11:16 -05:00 |
|
Philippe Tillet
|
e214927b16
|
Better control flow through options
|
2015-02-05 04:43:50 -05:00 |
|
Philippe Tillet
|
bbf2f0188e
|
Ported to C++11
|
2015-02-05 04:43:40 -05:00 |
|
Philippe Tillet
|
3a296ae3b7
|
Added a control flow API
|
2015-02-03 15:25:01 -05:00 |
|
Philippe Tillet
|
939ce15b45
|
Cleaner benchmarking code
|
2015-02-02 00:03:48 -05:00 |
|
Philippe Tillet
|
2afc574724
|
Implemented simple operation cache
|
2015-02-01 23:56:05 -05:00 |
|
Philippe Tillet
|
535706f35a
|
Some renaming; lower overhead in benchmark
|
2015-02-01 22:28:49 -05:00 |
|
Philippe Tillet
|
f0bb130416
|
Auto-tuner: Renamed "json_file" to "out"
|
2015-02-01 21:30:45 -05:00 |
|
Philippe Tillet
|
3b61842528
|
Lower overhead in the benchmarking source code
|
2015-02-01 18:59:27 -05:00 |
|
Philippe Tillet
|
b404b687ee
|
Incorporated low-level array representation to store array's parameters
|
2015-02-01 17:15:41 -05:00 |
|
Philippe Tillet
|
3f1fa822f8
|
save
|
2015-02-01 15:58:05 -05:00 |
|