triton

Files

Philippe Tillet 94c83d30ce [GENERAL] Removed deprecated driver files and added basic compatibility with rocm (#268 )

- Removed driver module -- accelerator runtime is handled by pytorch
- Added basic support for ROCM based on @micmelesse 's PR -- now can execute empty kernel on AMD devices without any compile-time changes
- Now only using PREFER_SHARED for kernels when the size of shared memory is greater than 49k. Otherwise there can be poor L1 performance for broadcast tensors

2021-09-09 00:04:28 -07:00

codegen

[GENERAL] Removed deprecated driver files and added basic compatibility with rocm (#268 )

2021-09-09 00:04:28 -07:00

driver

[GENERAL] Removed deprecated driver files and added basic compatibility with rocm (#268 )

2021-09-09 00:04:28 -07:00

[CODEGEN] Fixed bug in pipelining pass and casting semantics analysis (#257 )

2021-09-01 20:58:47 -07:00