Philippe Tillet
|
7d50b87681
|
[selection/codegen] bugfix in distributed tile indices initialization
|
2019-06-11 10:45:19 -07:00 |
|
Philippe Tillet
|
06b5992509
|
[feature] added basic tensor core support
|
2019-06-11 10:24:49 -07:00 |
|
Philippe Tillet
|
5f3d48c1d0
|
[tensor cores] added basic codegen template for using wmma
|
2019-06-07 21:19:47 -07:00 |
|
Philippe Tillet
|
ec4c6aaaaa
|
Added inline PTX for mma.sync
|
2019-06-07 19:39:33 -07:00 |
|
Philippe Tillet
|
f58c9a4d2b
|
[general] hmma baseline setup
|
2019-06-05 14:43:38 -07:00 |
|
Philippe Tillet
|
b2b55c52c9
|
[triton/python/conv]: Added cache for compiled kernels
|
2019-05-18 11:51:49 -04:00 |
|
Philippe Tillet
|
7b6efc0463
|
[examples/python/tensorflow] bugfix in tensorflow wrapper example
|
2019-04-30 21:04:30 -04:00 |
|
Philippe Tillet
|
af58b8bd81
|
[triton-c] predicate in assignment statement now propagates to rhs
computations
|
2019-04-27 14:00:15 -04:00 |
|
Philippe Tillet
|
4b77b764ba
|
[triton-c] added support for while loops
|
2019-04-26 15:08:02 -04:00 |
|
Philippe Tillet
|
3413aad582
|
[general] major overhaul of triton-c/triton-ir/triton-jit:
- Added alloc const
- Added atomics
- Pruning tuning space
- Added example for dot/conv/shift
- Bugfixes
|
2019-04-25 16:18:15 -04:00 |
|
Philippe Tillet
|
0c607c9392
|
[examples] normalize benchmark by max_clock / current_clock
|
2019-03-28 07:58:37 -04:00 |
|
Philippe Tillet
|
2c3ae0675e
|
[JIT] re-added nvidia compatibility
|
2019-03-27 21:12:01 -04:00 |
|
Philippe Tillet
|
bc2a257d5c
|
[code generation] more flexibility in backend selection
|
2019-03-27 11:29:42 -07:00 |
|
Philippe Tillet
|
e04253c0dd
|
[code generation] basic CPU backend
|
2019-03-27 11:13:36 -07:00 |
|
Philippe Tillet
|
9d6fc1c051
|
[code generation] bugfix in single buffering
|
2019-03-26 15:55:48 -07:00 |
|
Philippe Tillet
|
8d35c98920
|
[code generation] search space pruning
|
2019-03-25 14:10:24 -07:00 |
|
Philippe Tillet
|
deb7a1cc5c
|
Hack to make OpenCL for AMD work
|
2019-03-23 18:58:25 -07:00 |
|
Philippe Tillet
|
be55b3a081
|
saving progress
|
2019-03-23 16:52:53 -07:00 |
|
Philippe Tillet
|
9de9feff4a
|
[jit] added runtime for host but compilation still needs to be implemented
|
2019-03-23 13:40:42 -07:00 |
|
Philippe Tillet
|
49fd6ece99
|
some cleaning
|
2019-03-21 23:51:47 -07:00 |
|
Philippe Tillet
|
907bbb1ad2
|
[driver] now debugging AMD runtime
|
2019-03-20 17:32:17 -07:00 |
|
Philippe Tillet
|
b6305f4388
|
[driver] added more genericity for opencl support
|
2019-03-20 00:25:08 -07:00 |
|
Philippe Tillet
|
02775a226e
|
[driver] adding opencl in the driver API
|
2019-03-18 23:12:14 -07:00 |
|
Philippe Tillet
|
d2e7d7890d
|
[jit] preparing auto-tuning
|
2019-03-10 00:42:36 -05:00 |
|
Philippe Tillet
|
d049679aa2
|
[general] added simple jit interface
|
2019-03-08 23:58:42 -05:00 |
|
Philippe Tillet
|
daa828ec18
|
[general] rename namespace tdl -> triton
|
2019-02-24 14:35:16 -05:00 |
|
Philippe Tillet
|
6b49818282
|
[filesystem] rename tdl -> triton
|
2019-02-24 14:20:40 -05:00 |
|
Philippe Tillet
|
f697fcb887
|
[driver] fixed some bugs
|
2019-02-08 18:05:43 -05:00 |
|
Philippe Tillet
|
a9d219cdf5
|
[driver] added driver source code from isaac repository
|
2019-02-08 14:47:56 -05:00 |
|