Philippe Tillet
|
1783d45bef
|
[codegen] better handling of row/column-major
|
2019-10-04 16:07:31 -04:00 |
|
Philippe Tillet
|
a1e0512703
|
[codegen] more progress
|
2019-10-03 14:11:50 -04:00 |
|
Philippe Tillet
|
86a3e5d897
|
[codegen] now matrix-multiplication is bank-conflict free for all
layouts
|
2019-10-01 16:57:59 -04:00 |
|
Philippe Tillet
|
ed1b2bc563
|
more work on padding
|
2019-09-27 22:15:30 -04:00 |
|
Philippe Tillet
|
575dd06be3
|
[codegen] more progress towards unified dot implementation
|
2019-09-26 14:01:28 -04:00 |
|
Philippe Tillet
|
001973630e
|
[codegen] cleaned up shared memory and double-buffering logic
|
2019-09-21 22:21:40 -04:00 |
|
Philippe Tillet
|
43d88154bd
|
[codegen] cleaning-up / formalizing shared-memory passes
|
2019-09-20 16:01:12 -04:00 |
|
Philippe Tillet
|
8ae779206f
|
more fixes
|
2019-09-14 02:36:11 -04:00 |
|
Philippe Tillet
|
a842d337c5
|
[general] various cleaning and bugfix:
* added copy1d and copy2d benchmark
* fixed issue in reassociation pass
|
2019-09-02 23:00:49 -04:00 |
|