Commit Graph

289 Commits

Author SHA1 Message Date
Philippe Tillet
6595d20c34 Fixed over-head bug in the auto-tuner (not in the benchmarks) 2014-11-06 16:15:22 -05:00
Philippe Tillet
8dd492de23 Now using ViennaCL for reference instead of "naive" 2014-11-06 16:15:22 -05:00
Karl Rupp
71c72970f6 PyATIDLAS: Added architecture for Maxwell (sm_50) 2014-11-06 13:57:54 +01:00
Philippe Tillet
6049f0dfed Bench: Removed OpenCL with CUBLAS 2014-11-06 07:21:27 -05:00
Philippe Tillet
0b18a48eff Models: Bugfix... 2014-11-06 07:08:36 -05:00
Philippe Tillet
34998afee2 Templates/Reduction : Removed warnings 2014-11-06 07:08:09 -05:00
Philippe Tillet
146f54b32c Improved benchmark suite 2014-11-06 07:07:27 -05:00
Philippe Tillet
2be82fc272 Added support for Devastator and Scrapper 2014-11-02 11:28:05 -05:00
Philippe Tillet
f892d4d85d Added pyviennacl as a submodule 2014-11-02 11:05:19 -05:00
Philippe Tillet
6ca682e5eb Added include-guards in viennacl headers generation ; bugfix in autotune.py's operation parsing 2014-11-02 10:31:27 -05:00
Philippe Tillet
0ae87eeff4 Added viennacl-src-path in the UI 2014-11-02 10:05:14 -05:00
Philippe Tillet
45d602bfc8 Better UI 2014-10-31 18:56:33 -04:00
Philippe Tillet
89f3e1d211 Better GA initialization 2014-10-31 18:12:55 -04:00
Philippe Tillet
bdeb18429b Added CuBlas benchmark 2014-10-30 20:42:23 -04:00
Philippe Tillet
e576d12d66 Added ViennaCL time 2014-10-30 14:20:14 -04:00
Philippe Tillet
f0a4255daa Now actually benchmarking axpy 2014-10-30 14:11:50 -04:00
Philippe Tillet
32fa1e2b7c Now gnuplot script exports to file 2014-10-30 13:20:59 -04:00
Philippe Tillet
71b4bde3ec Better benchmarking 2014-10-30 13:04:33 -04:00
Philippe Tillet
de48ccc7b1 GEMM now specified in terms of MNK rather than MKN 2014-10-29 23:52:27 -04:00
Philippe Tillet
a8531cac37 Fixed bug in AXPY ; added the possibility to update a .json file 2014-10-29 23:38:22 -04:00
Philippe Tillet
b46f26e54c Auto-tuner: More flexibility 2014-10-29 12:45:20 -04:00
Philippe Tillet
086e51d291 More logical ENUM creation 2014-10-29 17:03:35 +01:00
Philippe Tillet
83bce648a0 Now using the median for benchmarking 2014-10-29 17:03:24 +01:00
Philippe Tillet
4712e3f235 Fixed import bug 2014-10-29 17:02:57 +01:00
Philippe Tillet
7707eada5e Fixed GEMM bug 2014-10-29 17:02:47 +01:00
Philippe Tillet
9803bc8c92 Enhancements of the auto-tuner 2014-10-29 17:01:57 +01:00
Philippe Tillet
199badc689 Cleaned some weird interaction between pyviennacl and pyatidlas 2014-10-28 02:43:42 -04:00
Philippe Tillet
9a5627e20f Prettier command line stuff 2014-10-28 01:10:14 -04:00
Philippe Tillet
1bc7d50258 Now everything is entirely handled through the command line 2014-10-27 22:22:07 -04:00
Philippe Tillet
740ba08c5c Added basic BLAS1 benchmark 2014-10-27 05:35:04 -04:00
Philippe Tillet
ba50960b0f Python/Autotune: Moved devices from config.ini to command line argument 2014-10-27 03:28:46 -04:00
Philippe Tillet
7780423fa1 C++: Slight modifications on tests 2014-10-27 03:28:03 -04:00
Philippe Tillet
ec24a29aa1 C++: Added possibility to override the model 2014-10-27 03:27:43 -04:00
Philippe Tillet
ee131be534 C++: Basic model handling 2014-10-27 01:42:11 -04:00
Philippe Tillet
11c283590f Replaced cxfreeze with pyinstaller. Works better. 2014-10-16 17:49:17 -04:00
Philippe Tillet
e0f0400a55 Config: Added SONAME for pyatidlas 2014-10-16 09:59:37 -04:00
Philippe Tillet
16c872f38d More packaging ... 2014-10-16 06:57:38 -04:00
Philippe Tillet
d4eb3ed820 Now correctly linking to pyviennacl 2014-10-15 14:21:00 -04:00
Philippe Tillet
30089b7fa6 ATIDLAS's python wrapper is now stand-alone 2014-10-15 05:24:19 -04:00
Philippe Tillet
f91d3b422a Now compiling ATIDLAS 2014-10-14 23:50:35 -04:00
Philippe Tillet
f60adab3dc Fixed bug in tools.py for NVidia GPUs 2014-10-14 13:58:28 -04:00
Philippe Tillet
6313f060cb Cleaned model building ; added some informative commented code 2014-10-14 17:43:14 +02:00
Philippe Tillet
b8d339e54d GEMM : Now performing an explicit unrolling of the loop 2014-10-10 21:43:59 -04:00
Philippe Tillet
2498c5c959 Fixed typo 2014-10-10 08:13:45 +02:00
Philippe Tillet
6737055d41 Improved occupancy calculation for AMD 2014-10-08 19:37:15 -04:00
Philippe Tillet
88f10958ba Removed obsolete imports 2014-10-05 21:24:08 +02:00
Philippe Tillet
1770d3d60b Removed pybrain dependency 2014-10-05 21:16:35 +02:00
Philippe Tillet
0606944d9d Removed sklearn's kernel density estimator dependency 2014-10-05 21:15:39 +02:00
Philippe Tillet
2b97648e24 Restored reductions tuning 2014-10-05 06:33:50 +02:00
Philippe Tillet
acb7fe73e8 Restored VCL header generation functionnality 2014-10-05 05:16:21 +02:00