Philippe Tillet
|
feeb1e9862
|
Feature: Merged kernel-fusion branch
* Fuses multiple AXPY kernel
* Possibility to add thread-wise for loops in AXPY-like kernels
|
2015-09-30 15:31:41 -04:00 |
|
Philippe Tillet
|
1399404f04
|
Code quality: Fixed issue with to_string for proper compilation on Cygwin GCC
|
2015-08-06 20:20:08 -07:00 |
|
Philippe Tillet
|
33fea11547
|
Code quality: more cleaning of files architecture
|
2015-08-06 19:34:26 -07:00 |
|
Philippe Tillet
|
db090d7942
|
Code quality: Large clean-up of the codebase and especially of the include/ folder
|
2015-08-06 12:05:12 -07:00 |
|
Philippe Tillet
|
cf5028d55b
|
Squashed feature branch:
* Added CUDA support
* Performance improvements
* API improvements
* Added "depth" parameter to GEMM
* Android cross-compilation
|
2015-04-29 15:52:21 -04:00 |
|
Philippe Tillet
|
d29f1252ad
|
Clearer array_expression with hopefully lower overhead.
Also removed pyc's
|
2015-01-31 22:01:48 -05:00 |
|
Philippe Tillet
|
e74563070a
|
API enhancement
|
2015-01-20 11:17:42 -05:00 |
|
Philippe Tillet
|
0068560bc6
|
Some cleaning + outer product
|
2015-01-17 10:49:36 -05:00 |
|
Philippe Tillet
|
faa3974f3c
|
Fixed some warnings
|
2015-01-16 07:38:26 -05:00 |
|
Philippe Tillet
|
69311b7982
|
Now ATIDLAS is standalone. Everything dynamic....
|
2015-01-12 13:24:06 -05:00 |
|