Go to file

Philippe Tillet 2b9355c9e4 [PYTHON][TENSORFLOW] Got rid of alloc_empty entirely; now doing

generating allocation code inside the tensorflow op

2019-10-30 01:38:30 -04:00

cmake

[cmake] better FindLLVM

2019-09-05 17:48:29 -04:00

docs

[doc][pytriton] now showing full requirements of triton.function

2019-10-14 11:36:54 -04:00

include/triton

[PYTHON][OPS] Added batch normalization op

2019-10-29 17:29:11 -04:00

lib

[PYTHON][OPS] Added batch normalization op

2019-10-29 17:29:11 -04:00

python

[PYTHON][TENSORFLOW] Got rid of alloc_empty entirely; now doing

2019-10-30 01:38:30 -04:00

tests

[PYTHON][EINSUM] Added support for FP16

2019-10-28 14:07:17 -04:00

CMakeLists.txt

[driver] now passing std::unique_ptr<> instead of cloning LLVM module

2019-09-05 17:25:58 -04:00

LICENSE

[LICENSING] updated license to incorporate credit for wgtcc

2019-08-23 17:56:30 -07:00

README.md

[documentation] swapped the order of pytriton and triton-c tutorial in README.md

2019-09-10 21:17:22 -04:00

README.md

Triton

This is the development repository of Triton, a language and compiler for writing highly efficient custom Deep-Learning primitives.

The formal foundations of this project are described in the following MAPL2019 publication: Triton: An Intermediate Language and Compiler for Tiled Neural Network Computations. Please cite us if you use our work!

The main features of Triton at the moment are:

PyTriton: A Python API for writing custom operations for Triton-C compute-kernels. PyTriton automatically generates and just-in-time Tensorflow and PyTorch bindings.
Triton-C: An imperative, single-threaded language for writing highly efficient compute-kernels at a relatively high abstraction level using numpy-like extensions of the C language.
Triton-IR: An intermediate-representation for optimizing multi-dimensional array operations in linear algebra programs
Triton-JIT: An optimizing just-in-time compiler for Triton-C, which generates GPU code on par with state-of-the-art CUDA-C (e.g., CUTLASS) and PTX (e.g., ISAAC). This includes transparent support for mixed-precision and Tensor Cores.

Installation

Triton is a fairly self-contained package and uses its own parser (forked from wgtcc) and LLVM code-generator. However, at the moment it still relies on LLVM-8.0+ for PTX code generation.

sudo apt-get install llvm-8-dev
git clone https://github.com/ptillet/triton.git;
cd triton/python/;
python setup.py develop;
cd examples;
python dot.py

Tutorials

The PyTriton API
The Triton-C language
The Triton-IR representation (coming soon...)
The Triton-JIT compiler (coming soon...)