Commit Graph

1237 Commits

Author SHA1 Message Date
Philippe Tillet
0b35c5ff3f [PYTHON][KERNEL] Better handling of case where cache directory already
exists
2020-04-07 13:18:11 -04:00
Philippe Tillet
e04efc1c85 [GENERAL] Error messages now no longer make terminal color green 2020-04-03 23:25:29 -04:00
Philippe Tillet
7c09ff80eb [CORE] Fixed several issues that arose in the development of the
torch-blocksparse package:

* Now using warp shuffle in reductions when possible
* Various bugfixes in layout inference
* Added INFINITY, exponential and select
* Better error messages for unimplemented constructs
2020-03-31 18:57:28 -04:00
Phillippe Tillet
f68ed147fa Merge pull request #35 from jack-willturner/iss33
[PYTHON] Add empty string to llvm-config versions in setup.py
2020-03-13 15:06:32 -04:00
Jack Turner
53b9e4790e [PYTHON] Add empty string to llvm-config versions in setup.py 2020-03-13 18:03:25 +00:00
Philippe Tillet
699711724a [DOCS] Fixed typo: triton.function -> torch.autograd.Function 2020-03-13 11:44:55 -04:00
Philippe Tillet
7ee3380faf [DOCS] Made documentation consistent with the new kernel API 2020-03-10 13:25:57 -04:00
Philippe Tillet
2dcf381bdc [CORE][TOOLS][BENCH] Disabled normalization for auto-tuning 2020-03-06 18:41:36 -05:00
Philippe Tillet
a27b96cad1 [CORE][DRIVER] Now only using PTX6.4 if CUDA10.1+ is detected 2020-03-05 21:29:22 -05:00
Philippe Tillet
20c0246d88 [CMAKE] target_link_directories -> link_directories 2020-03-05 20:16:46 -05:00
Philippe Tillet
b7895c653f [PYTHON][EXAMPLES] Removed BlockSparse examples; see
https://github.com/ptillet/torch-blocksparse.git
2020-03-05 13:32:42 -05:00
Philippe Tillet
1f1e4ee9ec [PYTHON] Merged blocksparse branch:
* Example for blocksparse matrix multiplication
* Simplified Triton kernel API
* Revived auto-tuning in einsum
2020-03-05 13:08:07 -05:00
Philippe Tillet
51bc244b2c [PYTHON][OPS] Fixed typo in einsum 2020-02-25 16:39:04 -05:00
Philippe Tillet
cc6892f9f4 [CMAKE] Fixed issue in LLVM link directory 2020-02-25 12:52:23 -08:00
Philippe Tillet
3ee342d056 [TRITON][NN][CONV] Renamed input -> x to not modify built-in functions 2020-02-25 10:56:39 -05:00
Philippe Tillet
1e76bb4145 [PYTHON][NN][CONV] Fixed typo in dx computation 2020-02-24 22:45:49 -05:00
Philippe Tillet
cc094936d0 [PYTHON][SETUP] Removed obsolete debug print() 2020-02-24 21:35:45 -05:00
Philippe Tillet
d0ca849be2 [DOCUMENTATION] Updated installation instructions in README.md 2020-02-24 18:07:25 -05:00
Philippe Tillet
a59c24ab67 [CORE] Added missing sha1.hpp file 2020-02-24 18:02:18 -05:00
Philippe Tillet
8c35bd775f [PYTHON] Added missing files for nn submodule 2020-02-24 17:58:24 -05:00
Philippe Tillet
67c633aa2d [PYTHON] Better packaging 2020-02-24 17:46:20 -05:00
Philippe Tillet
f2daff85d2 [GENERAL] Improved caching mechanism:
* Now computing hash in libtriton
* Now only compiling a single pytorch hook per function signature
2020-02-24 16:36:50 -05:00
Philippe Tillet
29c38b38e5 [PYTHON][OPS][EINSUM] Now throwing error for automatic differentiation
of extended einsum
2020-02-20 17:29:01 -05:00
Philippe Tillet
9693fe1441 [PYTHON][OP][EINSUM] simplified API 2020-02-19 23:42:22 -05:00
Philippe Tillet
c30c4861be [PYTHON][OPS][EINSUM] Added support for inner tensor strides 2020-02-19 11:50:17 -05:00
Philippe Tillet
7621aeda3f [CODEGEN][TRANSFORM][PEEPHOLE] Fixed bug in *1 multiplication 2020-02-19 00:18:55 -05:00
Philippe Tillet
304b003969 [PYTHON][EXAMPLES] Removed obsolete files 2020-02-18 12:26:06 -05:00
Philippe Tillet
d11d2db6ee [PYTHON][EINSUM] Now handling reduction sizes that are not a multiple of
TK
2020-02-17 13:52:58 -05:00
Philippe Tillet
fcdc65ffb0 [PYTHON][OPS][EINSUM] Added support for masked accumulator 2020-02-13 18:11:12 -05:00
Philippe Tillet
ae462e4fa1 [DOCS][TUTORIALS] Fixed typo 2020-02-10 04:22:45 -05:00
Philippe Tillet
6a4d42c1b8 [PYTHON][CORE] Deprecating Tensorflow support 2020-02-10 04:20:33 -05:00
Philippe Tillet
7c37258549 [DOCS] Fixed README.md 2020-02-10 03:47:49 -05:00
Philippe Tillet
0493863ee9 [DOC] Fixed formatting issues in tutorial 2020-02-10 03:27:36 -05:00
Philippe Tillet
868922e9b0 [DOC] Added more tutorials 2020-02-10 03:18:23 -05:00
Philippe Tillet
56330f1fdf [DOC] Basic Sphynx skeleton 2020-02-10 01:56:45 -05:00
Philippe Tillet
a099c6f7f3 [TRITON][LANG] Added support for bitcast 2020-02-09 20:11:13 -05:00
Philippe Tillet
0767c27b3b [README] Added comment about ISAAC 2020-02-06 14:09:09 -05:00
Philippe Tillet
4ff0205275 [GENERAL] Updated README.md 2020-02-06 13:59:42 -05:00
Philippe Tillet
5a3c30148e [PYTHON][EXAMPLES] Changed shape of einsum examples 2020-02-06 13:57:30 -05:00
Philippe Tillet
e6fc28050a [GENERAL] Updated README.md 2020-02-06 13:20:25 -05:00
Philippe Tillet
48a4dc172f [GENERAL] ISAAC -> Triton 2020-02-06 00:49:18 -05:00
Philippe Tillet
fa29e63838 [GENERAL] Deleted ISAAC Files 2020-02-06 00:48:45 -05:00
Philippe Tillet
3e92901bd5 [TRITON][PYTHON] Cleaned up API 2020-02-05 19:44:19 -05:00
Philippe Tillet
2fcf5cec5b [TRITON][CODEGEN] Fixed flawed assert() 2020-01-24 15:25:00 -05:00
Philippe Tillet
db941161ed [PYTHON][EXAMPLES] Cleaned self-attention benchmarks 2020-01-22 18:09:00 -05:00
Philippe Tillet
ce7a00674a [PYTHON][EXAMPLES] Added self-attention example using triton.ops.einsum 2020-01-21 16:45:04 -05:00
Philippe Tillet
78b98fb7cf [GENERAL] Cleaned polymorphic structure of layouts analysis pass 2020-01-21 11:38:39 -05:00
Philippe Tillet
382ca2c745 [CODEGEN][ANALYSIS] cleaning: moving towards better polymorphism for
tile layouts
2020-01-20 12:43:04 -05:00
Philippe Tillet
fbf2a3f56f [CODEGEN][TRANSFORM] some bug-fixes for FP32 einsum 2020-01-20 12:42:53 -05:00
Philippe Tillet
f278d9741a [GENERAL] Merged einsum feature branch. Various feature, performance
improvements and bugfixes:

* Added preliminary support for extended Einstein summation in PyTriton
* Significant performance improvement on FP32 kernels containing matrix
multiplication
* Added re-coalescing pass for FP16 kernels containing matrix
multiplication
* Various bugfixes
2020-01-20 12:42:48 -05:00