Philippe Tillet
|
08fcfbca47
|
[code generation] better predication
|
2019-03-01 14:36:17 -05:00 |
|
Philippe Tillet
|
36acf22fd3
|
better masking
|
2019-02-28 23:46:11 -05:00 |
|
Philippe Tillet
|
017702590b
|
[intermediate representation] added ternary_inst
|
2019-02-26 14:20:58 -05:00 |
|
Philippe Tillet
|
68dea75aa0
|
[syntax tree] more fixes in lowering phi nodes
|
2019-02-26 12:36:37 -05:00 |
|
Philippe Tillet
|
338f291835
|
[code generation] now ordered iterations across distributed tiles
|
2019-02-25 11:41:45 -05:00 |
|
Philippe Tillet
|
6dc88878ac
|
[code generation] bugfix in double-buffering
|
2019-02-24 23:22:28 -05:00 |
|
Philippe Tillet
|
daa828ec18
|
[general] rename namespace tdl -> triton
|
2019-02-24 14:35:16 -05:00 |
|
Philippe Tillet
|
6b49818282
|
[filesystem] rename tdl -> triton
|
2019-02-24 14:20:40 -05:00 |
|
Philippe Tillet
|
1b5f7f2139
|
[code generation] basic metaparameter support
|
2019-02-23 22:24:12 -05:00 |
|
Philippe Tillet
|
8f4798b81a
|
[intermediate representation] transitioning towards more flexible tile
shapes
|
2019-02-23 11:37:01 -05:00 |
|
Philippe Tillet
|
7cda55df16
|
[code generation] implements hidden operands in user (e.g., mask)
|
2019-02-21 18:00:27 -05:00 |
|
Philippe Tillet
|
5618a15dc1
|
[code generation] more bugfixes in control flow
|
2019-02-20 22:55:20 -05:00 |
|
Philippe Tillet
|
90ec0ae2c0
|
[code generation] some more bugfixing with nested control flow
|
2019-02-18 22:54:08 -05:00 |
|
Philippe Tillet
|
cf1a583dbf
|
bla
|
2019-02-15 22:03:09 -05:00 |
|
Philippe Tillet
|
5f5959dc6e
|
[code generation] added masked loads
|
2019-02-15 11:14:50 -05:00 |
|
Philippe Tillet
|
32562677e9
|
[code generation] added barriers placement
|
2019-02-12 19:36:16 -05:00 |
|
Philippe Tillet
|
41aad4800c
|
[code generation] added double-buffering
|
2019-02-12 11:47:52 -05:00 |
|
Philippe Tillet
|
e45d6bbb60
|
some cleaning
|
2019-02-12 11:00:24 -05:00 |
|
Philippe Tillet
|
f8e522ada8
|
blabla
|
2019-02-11 17:27:16 -05:00 |
|
Philippe Tillet
|
b2e487491f
|
[code generation] now vectorizing shared memory stores
|
2019-02-10 21:59:41 -05:00 |
|
Philippe Tillet
|
8ab5ca3de3
|
blabla
|
2019-02-10 20:41:07 -05:00 |
|
Philippe Tillet
|
3d07e909c6
|
attempting vectorization
|
2019-02-10 18:29:25 -05:00 |
|
Philippe Tillet
|
4a0736ce20
|
[code generation] in-place CSE in shared memory reads
|
2019-02-09 23:56:53 -05:00 |
|
Philippe Tillet
|
d39f97ef38
|
[code generation] simple matrix-multiplication working
|
2019-02-09 19:20:50 -05:00 |
|
Philippe Tillet
|
77dd99efe8
|
[code generation] bug fixes in grid axes binding
|
2019-02-08 23:32:17 -05:00 |
|
Philippe Tillet
|
90c0474974
|
[examples] improved template for testing matrix multiplication
|
2019-02-08 12:54:20 -05:00 |
|
Philippe Tillet
|
dd35277858
|
[examples] added basic skeleton to generate matrix multiplication PTX
|
2019-02-07 22:42:54 -05:00 |
|
Philippe Tillet
|
1b9a7a8e97
|
[code generation] added basic shared copy/read
|
2019-02-07 17:03:19 -05:00 |
|
Philippe Tillet
|
5fdb27d9ae
|
[code generation] fixed bug in tile phi nodes
|
2019-02-06 23:34:45 -05:00 |
|
Philippe Tillet
|
53aca3fa89
|
[code generation] fixed bugs in tile instructions lowering
|
2019-02-06 17:30:33 -05:00 |
|
Philippe Tillet
|
4490061950
|
test
|
2019-02-06 17:21:07 -05:00 |
|
Philippe Tillet
|
5aec34a094
|
[code generation] improved handling of constants
|
2019-02-06 15:02:01 -05:00 |
|
Philippe Tillet
|
e522b06be2
|
[code generation]: more progress for instruction selection
|
2019-01-26 02:05:56 -05:00 |
|
Philippe Tillet
|
7eebdceb6a
|
[code generation] fixed bug in on-the-fly AST to IR lowering
|
2019-01-23 00:11:42 -05:00 |
|
Philippe Tillet
|
a0ecdba5a2
|
[code generation] testing analysis passes
|
2019-01-12 23:24:25 -05:00 |
|
Philippe Tillet
|
297d1a99d1
|
[code generation] adding missing files
|
2019-01-07 22:49:37 -05:00 |
|