triton

Files

Philippe Tillet 94c83d30ce [GENERAL] Removed deprecated driver files and added basic compatibility with rocm (#268 )

- Removed driver module -- accelerator runtime is handled by pytorch
- Added basic support for ROCM based on @micmelesse 's PR -- now can execute empty kernel on AMD devices without any compile-time changes
- Now only using PREFER_SHARED for kernels when the size of shared memory is greater than 49k. Otherwise there can be poor L1 performance for broadcast tensors

2021-09-09 00:04:28 -07:00

01-vector-add.py

[GENERAL] Removed deprecated driver files and added basic compatibility with rocm (#268 )

2021-09-09 00:04:28 -07:00

02-fused-softmax.py

[DOCS] softmax tutorial fixup (#198 )

2021-08-11 17:35:00 -07:00

03-matrix-multiplication.py

[DOCS] Various improvements (#224 )

2021-08-18 11:15:53 -07:00

04-low-memory-dropout.py

[LANG] Added seeded random number generation - philox (#261 )

2021-09-02 22:02:40 -07:00

README.rst

[DOCS] Re-structured documentation hierarchy

2021-07-27 12:38:49 -07:00

README.rst

Tutorials
==================

Below is a gallery of tutorials for writing various basic operations with Triton. It is recommended that you read through the tutorials in order, starting with the simplest one.