Philippe Tillet
|
6595d20c34
|
Fixed over-head bug in the auto-tuner (not in the benchmarks)
|
2014-11-06 16:15:22 -05:00 |
|
Philippe Tillet
|
8dd492de23
|
Now using ViennaCL for reference instead of "naive"
|
2014-11-06 16:15:22 -05:00 |
|
Karl Rupp
|
71c72970f6
|
PyATIDLAS: Added architecture for Maxwell (sm_50)
|
2014-11-06 13:57:54 +01:00 |
|
Philippe Tillet
|
6049f0dfed
|
Bench: Removed OpenCL with CUBLAS
|
2014-11-06 07:21:27 -05:00 |
|
Philippe Tillet
|
0b18a48eff
|
Models: Bugfix...
|
2014-11-06 07:08:36 -05:00 |
|
Philippe Tillet
|
34998afee2
|
Templates/Reduction : Removed warnings
|
2014-11-06 07:08:09 -05:00 |
|
Philippe Tillet
|
146f54b32c
|
Improved benchmark suite
|
2014-11-06 07:07:27 -05:00 |
|
Philippe Tillet
|
2be82fc272
|
Added support for Devastator and Scrapper
|
2014-11-02 11:28:05 -05:00 |
|
Philippe Tillet
|
f892d4d85d
|
Added pyviennacl as a submodule
|
2014-11-02 11:05:19 -05:00 |
|
Philippe Tillet
|
6ca682e5eb
|
Added include-guards in viennacl headers generation ; bugfix in autotune.py's operation parsing
|
2014-11-02 10:31:27 -05:00 |
|
Philippe Tillet
|
0ae87eeff4
|
Added viennacl-src-path in the UI
|
2014-11-02 10:05:14 -05:00 |
|
Philippe Tillet
|
45d602bfc8
|
Better UI
|
2014-10-31 18:56:33 -04:00 |
|
Philippe Tillet
|
89f3e1d211
|
Better GA initialization
|
2014-10-31 18:12:55 -04:00 |
|
Philippe Tillet
|
bdeb18429b
|
Added CuBlas benchmark
|
2014-10-30 20:42:23 -04:00 |
|
Philippe Tillet
|
e576d12d66
|
Added ViennaCL time
|
2014-10-30 14:20:14 -04:00 |
|
Philippe Tillet
|
f0a4255daa
|
Now actually benchmarking axpy
|
2014-10-30 14:11:50 -04:00 |
|
Philippe Tillet
|
32fa1e2b7c
|
Now gnuplot script exports to file
|
2014-10-30 13:20:59 -04:00 |
|
Philippe Tillet
|
71b4bde3ec
|
Better benchmarking
|
2014-10-30 13:04:33 -04:00 |
|
Philippe Tillet
|
de48ccc7b1
|
GEMM now specified in terms of MNK rather than MKN
|
2014-10-29 23:52:27 -04:00 |
|
Philippe Tillet
|
a8531cac37
|
Fixed bug in AXPY ; added the possibility to update a .json file
|
2014-10-29 23:38:22 -04:00 |
|
Philippe Tillet
|
b46f26e54c
|
Auto-tuner: More flexibility
|
2014-10-29 12:45:20 -04:00 |
|
Philippe Tillet
|
086e51d291
|
More logical ENUM creation
|
2014-10-29 17:03:35 +01:00 |
|
Philippe Tillet
|
83bce648a0
|
Now using the median for benchmarking
|
2014-10-29 17:03:24 +01:00 |
|
Philippe Tillet
|
4712e3f235
|
Fixed import bug
|
2014-10-29 17:02:57 +01:00 |
|
Philippe Tillet
|
7707eada5e
|
Fixed GEMM bug
|
2014-10-29 17:02:47 +01:00 |
|
Philippe Tillet
|
9803bc8c92
|
Enhancements of the auto-tuner
|
2014-10-29 17:01:57 +01:00 |
|
Philippe Tillet
|
199badc689
|
Cleaned some weird interaction between pyviennacl and pyatidlas
|
2014-10-28 02:43:42 -04:00 |
|
Philippe Tillet
|
9a5627e20f
|
Prettier command line stuff
|
2014-10-28 01:10:14 -04:00 |
|
Philippe Tillet
|
1bc7d50258
|
Now everything is entirely handled through the command line
|
2014-10-27 22:22:07 -04:00 |
|
Philippe Tillet
|
740ba08c5c
|
Added basic BLAS1 benchmark
|
2014-10-27 05:35:04 -04:00 |
|
Philippe Tillet
|
ba50960b0f
|
Python/Autotune: Moved devices from config.ini to command line argument
|
2014-10-27 03:28:46 -04:00 |
|
Philippe Tillet
|
7780423fa1
|
C++: Slight modifications on tests
|
2014-10-27 03:28:03 -04:00 |
|
Philippe Tillet
|
ec24a29aa1
|
C++: Added possibility to override the model
|
2014-10-27 03:27:43 -04:00 |
|
Philippe Tillet
|
ee131be534
|
C++: Basic model handling
|
2014-10-27 01:42:11 -04:00 |
|
Philippe Tillet
|
11c283590f
|
Replaced cxfreeze with pyinstaller. Works better.
|
2014-10-16 17:49:17 -04:00 |
|
Philippe Tillet
|
e0f0400a55
|
Config: Added SONAME for pyatidlas
|
2014-10-16 09:59:37 -04:00 |
|
Philippe Tillet
|
16c872f38d
|
More packaging ...
|
2014-10-16 06:57:38 -04:00 |
|
Philippe Tillet
|
d4eb3ed820
|
Now correctly linking to pyviennacl
|
2014-10-15 14:21:00 -04:00 |
|
Philippe Tillet
|
30089b7fa6
|
ATIDLAS's python wrapper is now stand-alone
|
2014-10-15 05:24:19 -04:00 |
|
Philippe Tillet
|
f91d3b422a
|
Now compiling ATIDLAS
|
2014-10-14 23:50:35 -04:00 |
|
Philippe Tillet
|
f60adab3dc
|
Fixed bug in tools.py for NVidia GPUs
|
2014-10-14 13:58:28 -04:00 |
|
Philippe Tillet
|
6313f060cb
|
Cleaned model building ; added some informative commented code
|
2014-10-14 17:43:14 +02:00 |
|
Philippe Tillet
|
b8d339e54d
|
GEMM : Now performing an explicit unrolling of the loop
|
2014-10-10 21:43:59 -04:00 |
|
Philippe Tillet
|
2498c5c959
|
Fixed typo
|
2014-10-10 08:13:45 +02:00 |
|
Philippe Tillet
|
6737055d41
|
Improved occupancy calculation for AMD
|
2014-10-08 19:37:15 -04:00 |
|
Philippe Tillet
|
88f10958ba
|
Removed obsolete imports
|
2014-10-05 21:24:08 +02:00 |
|
Philippe Tillet
|
1770d3d60b
|
Removed pybrain dependency
|
2014-10-05 21:16:35 +02:00 |
|
Philippe Tillet
|
0606944d9d
|
Removed sklearn's kernel density estimator dependency
|
2014-10-05 21:15:39 +02:00 |
|
Philippe Tillet
|
2b97648e24
|
Restored reductions tuning
|
2014-10-05 06:33:50 +02:00 |
|
Philippe Tillet
|
acb7fe73e8
|
Restored VCL header generation functionnality
|
2014-10-05 05:16:21 +02:00 |
|