Philippe Tillet
6bc5d9e1cb
Python: fixed compilation issues
2016-04-10 15:41:55 -04:00
Philippe Tillet
97a0d65a4d
Code quality: reorganized files structure
2016-04-10 13:13:16 -04:00
Philippe Tillet
7f77fba4d4
General: Internal code generator overhaul
2016-04-02 18:19:33 -04:00
Philippe Tillet
6623116372
Licensing: added blank line after license text
2015-12-21 17:04:09 -05:00
Philippe Tillet
0d09b0518f
API: more consistent zeros() initializer
2015-12-21 03:33:13 -05:00
Philippe Tillet
ebbb6dd18e
LICENSING: added license headers ; polished files hierarchy
2015-12-19 21:43:05 -05:00
Philippe Tillet
d9eb51d04a
Code quality: renamed math_expression -> expression_tree
2015-12-19 03:29:51 -05:00
Philippe Tillet
e2cdb88338
Core: included bugfixes from the SVD branch
2015-11-19 12:37:18 -05:00
Philippe Tillet
714e0f5634
API: Fixed single-element indexing
2015-10-07 01:13:55 -04:00
Philippe Tillet
2648724217
API: diag() now usable as lvalue
2015-10-07 00:50:49 -04:00
Philippe Tillet
07b8ba20de
API: some fixes with 1D slices
2015-10-06 16:34:47 -04:00
Philippe Tillet
8daf13da2e
Code quality: some renaming here and there
2015-10-05 14:35:46 -04:00
Philippe Tillet
3e4f147fbc
Code quality: removed ambiguous overload
2015-10-04 17:31:39 -04:00
Philippe Tillet
d97250bce5
API: removed explicit constructors for math expressions
2015-10-04 17:08:44 -04:00
Philippe Tillet
07e7bd862c
API: added diag(matrix)
2015-10-04 17:05:06 -04:00
Philippe Tillet
740f5def49
API: polished slice construction
2015-10-03 19:30:50 -04:00
Philippe Tillet
b5100f9d9a
API: Added shallow-copiable view object for viewing slices of arrays.
2015-10-03 18:51:02 -04:00
Philippe Tillet
1e076c131b
API: clearer interface for transposition
2015-10-01 21:58:59 -04:00
Philippe Tillet
feeb1e9862
Feature: Merged kernel-fusion branch
...
* Fuses multiple AXPY kernel
* Possibility to add thread-wise for loops in AXPY-like kernels
2015-09-30 15:31:41 -04:00
Philippe Tillet
f7cb4ac960
Code quality: fixed implicit conversions from size_t to int_t
2015-08-13 14:30:11 -07:00
U-AMR\ptillet
9fda50863a
Code quality: made array constructors explicit ; added missing include for <accumulate>
2015-08-13 14:01:34 -07:00
Philippe Tillet
1565363722
Driver: reverted lookup map key (for queues/program) to objects rather than pointers
2015-08-03 17:37:19 -07:00
Philippe Tillet
92d8afc888
Code quality: cleaned backend terminology
2015-08-03 17:11:24 -07:00
Philippe Tillet
81b9f01336
Driver: Contexts are now unique and non-copyable
2015-07-31 00:41:03 -07:00
Philippe Tillet
29bdf7f546
Code quality: made the backend static
2015-07-30 21:15:37 -07:00
Philippe Tillet
d22d86e5c7
Code quality: Added ISAACAPI macro for exporting symbols into shared library ; exported symbols in array.h
2015-07-21 21:35:56 -07:00
U-AMR\ptillet
8879a867d8
Code Quality: fixed compilation errors on MSVC
2015-07-20 18:05:31 -07:00
Philippe Tillet
cfa6ea812d
Cleaning: Largely renamed templates to BLAS-like names
2015-07-11 11:21:15 -04:00
Philippe Tillet
e7cabf65ac
Tuning: Merged tune branch.
...
- Much cleaner and more concise source
- Better exceptions handling
- Checks local minima to see if retuning is needed.
Resolved conflicts:
bench/blas.cpp
include/isaac/backend/templates/mproduct.h
include/isaac/driver/buffer.h
lib/array.cpp
lib/backend/templates/mproduct.cpp
lib/driver/buffer.cpp
python/setup.py
tune/pysrc/autotune.py
tune/pysrc/dataset.py
tune/pysrc/misc_tools.py
2015-06-28 17:53:16 -07:00
Philippe
4cce9d3efd
C: More clBLAS tests
2015-06-27 11:44:50 -04:00
Philippe Tillet
b32de3ac76
C++: More clBLAS routines
2015-06-25 08:12:16 -07:00
Philippe Tillet
9f7e34ba5d
C++: Added clBLAS sGEMM ABI (still buggy)
2015-06-24 07:51:27 -07:00
Philippe Tillet
80bcbd095f
C++: Some renaming; added possibility to pass buffers when constructing arrays
2015-06-23 09:38:34 -07:00
Philippe Tillet
f55e499ef5
C++: added support for [unsigned] long long
2015-05-04 23:54:43 -04:00
Philippe Tillet
278109eef8
C++: Now using standard C++ types instead of stdint
2015-05-04 21:23:05 -04:00
Philippe Tillet
cf5028d55b
Squashed feature branch:
...
* Added CUDA support
* Performance improvements
* API improvements
* Added "depth" parameter to GEMM
* Android cross-compilation
2015-04-29 15:52:21 -04:00