Philippe Tillet
|
30833c18f1
|
[codegen/tune] bugfix in heuristics for nano-tile sizes
|
2019-05-04 01:32:34 -04:00 |
|
Philippe Tillet
|
3413aad582
|
[general] major overhaul of triton-c/triton-ir/triton-jit:
- Added alloc const
- Added atomics
- Pruning tuning space
- Added example for dot/conv/shift
- Bugfixes
|
2019-04-25 16:18:15 -04:00 |
|
Philippe Tillet
|
0c607c9392
|
[examples] normalize benchmark by max_clock / current_clock
|
2019-03-28 07:58:37 -04:00 |
|
Philippe Tillet
|
2c3ae0675e
|
[JIT] re-added nvidia compatibility
|
2019-03-27 21:12:01 -04:00 |
|
Philippe Tillet
|
fdf8559806
|
[general] added missing files
|
2019-03-27 20:01:35 -04:00 |
|
Philippe Tillet
|
bc2a257d5c
|
[code generation] more flexibility in backend selection
|
2019-03-27 11:29:42 -07:00 |
|
Philippe Tillet
|
e04253c0dd
|
[code generation] basic CPU backend
|
2019-03-27 11:13:36 -07:00 |
|
Philippe Tillet
|
8d35c98920
|
[code generation] search space pruning
|
2019-03-25 14:10:24 -07:00 |
|
Philippe Tillet
|
b73c3bdd25
|
[examples] removed dependency on isaac for auto-tuning
|
2019-03-11 22:22:43 -04:00 |
|
Philippe Tillet
|
87c85ed50d
|
[code generation] reparameterization
|
2019-03-11 19:30:21 -04:00 |
|
Philippe Tillet
|
614f83baee
|
[jit] basic auto-tuning support
|
2019-03-11 12:00:50 -04:00 |
|
Philippe Tillet
|
94e315ea8a
|
Reparameterized in terms of micro- and nano- tiles
|
2019-03-10 23:10:17 -04:00 |
|
Philippe Tillet
|
c96a263896
|
[jit] changed default metaparameter ranges
|
2019-03-10 10:45:21 -04:00 |
|
Philippe Tillet
|
9a3537662d
|
[jit] can now infer launch parameters from triton module
|
2019-03-09 14:44:13 -05:00 |
|
Philippe Tillet
|
b721202812
|
[code generation] uniformized shape and layout metaparameters
|
2019-03-09 12:31:21 -05:00 |
|
Philippe Tillet
|
5f29263044
|
[code generation] now using ir::metaparameter* for all tunable
metaparameters
|
2019-03-09 12:05:12 -05:00 |
|
Philippe Tillet
|
36acf22fd3
|
better masking
|
2019-02-28 23:46:11 -05:00 |
|
Philippe Tillet
|
daa828ec18
|
[general] rename namespace tdl -> triton
|
2019-02-24 14:35:16 -05:00 |
|
Philippe Tillet
|
6b49818282
|
[filesystem] rename tdl -> triton
|
2019-02-24 14:20:40 -05:00 |
|
Philippe Tillet
|
8f4798b81a
|
[intermediate representation] transitioning towards more flexible tile
shapes
|
2019-02-23 11:37:01 -05:00 |
|
Philippe Tillet
|
7cda55df16
|
[code generation] implements hidden operands in user (e.g., mask)
|
2019-02-21 18:00:27 -05:00 |
|
Philippe Tillet
|
5618a15dc1
|
[code generation] more bugfixes in control flow
|
2019-02-20 22:55:20 -05:00 |
|
Philippe Tillet
|
cf1a583dbf
|
bla
|
2019-02-15 22:03:09 -05:00 |
|
Philippe Tillet
|
5f5959dc6e
|
[code generation] added masked loads
|
2019-02-15 11:14:50 -05:00 |
|
Philippe Tillet
|
5aec34a094
|
[code generation] improved handling of constants
|
2019-02-06 15:02:01 -05:00 |
|
Philippe Tillet
|
e522b06be2
|
[code generation]: more progress for instruction selection
|
2019-01-26 02:05:56 -05:00 |
|
Philippe Tillet
|
7eebdceb6a
|
[code generation] fixed bug in on-the-fly AST to IR lowering
|
2019-01-23 00:11:42 -05:00 |
|
Philippe Tillet
|
80d019ec16
|
[syntax tree] added syntactic support for dereferencing
|
2019-01-10 23:53:27 -05:00 |
|
Philippe Tillet
|
b5c8c25d43
|
more debugging
|
2019-01-10 16:50:47 -05:00 |
|
Philippe Tillet
|
4f923accd7
|
[syntax tree] added basic support for range
|
2019-01-09 02:07:34 -05:00 |
|
Philippe Tillet
|
73db84c8ba
|
[syntax tree] fixed broadcast semantics lowering
|
2019-01-08 17:44:31 -05:00 |
|
Philippe Tillet
|
7a14693f51
|
[code generation] added constraints checking
|
2019-01-08 15:57:45 -05:00 |
|
Philippe Tillet
|
58757b8f10
|
[code generation] some more cleaning
|
2019-01-08 12:39:25 -05:00 |
|
Philippe Tillet
|
297d1a99d1
|
[code generation] adding missing files
|
2019-01-07 22:49:37 -05:00 |
|