Philippe Tillet
0b35c5ff3f
[PYTHON][KERNEL] Better handling of case where cache directory already
...
exists
2020-04-07 13:18:11 -04:00
Philippe Tillet
e04efc1c85
[GENERAL] Error messages now no longer make terminal color green
2020-04-03 23:25:29 -04:00
Philippe Tillet
7c09ff80eb
[CORE] Fixed several issues that arose in the development of the
...
torch-blocksparse package:
* Now using warp shuffle in reductions when possible
* Various bugfixes in layout inference
* Added INFINITY, exponential and select
* Better error messages for unimplemented constructs
2020-03-31 18:57:28 -04:00
Phillippe Tillet
f68ed147fa
Merge pull request #35 from jack-willturner/iss33
...
[PYTHON] Add empty string to llvm-config versions in setup.py
2020-03-13 15:06:32 -04:00
Jack Turner
53b9e4790e
[PYTHON] Add empty string to llvm-config versions in setup.py
2020-03-13 18:03:25 +00:00
Philippe Tillet
699711724a
[DOCS] Fixed typo: triton.function -> torch.autograd.Function
2020-03-13 11:44:55 -04:00
Philippe Tillet
7ee3380faf
[DOCS] Made documentation consistent with the new kernel API
2020-03-10 13:25:57 -04:00
Philippe Tillet
2dcf381bdc
[CORE][TOOLS][BENCH] Disabled normalization for auto-tuning
2020-03-06 18:41:36 -05:00
Philippe Tillet
a27b96cad1
[CORE][DRIVER] Now only using PTX6.4 if CUDA10.1+ is detected
2020-03-05 21:29:22 -05:00
Philippe Tillet
20c0246d88
[CMAKE] target_link_directories -> link_directories
2020-03-05 20:16:46 -05:00
Philippe Tillet
b7895c653f
[PYTHON][EXAMPLES] Removed BlockSparse examples; see
...
https://github.com/ptillet/torch-blocksparse.git
2020-03-05 13:32:42 -05:00
Philippe Tillet
1f1e4ee9ec
[PYTHON] Merged blocksparse branch:
...
* Example for blocksparse matrix multiplication
* Simplified Triton kernel API
* Revived auto-tuning in einsum
2020-03-05 13:08:07 -05:00
Philippe Tillet
51bc244b2c
[PYTHON][OPS] Fixed typo in einsum
2020-02-25 16:39:04 -05:00
Philippe Tillet
cc6892f9f4
[CMAKE] Fixed issue in LLVM link directory
2020-02-25 12:52:23 -08:00
Philippe Tillet
3ee342d056
[TRITON][NN][CONV] Renamed input -> x to not modify built-in functions
2020-02-25 10:56:39 -05:00
Philippe Tillet
1e76bb4145
[PYTHON][NN][CONV] Fixed typo in dx computation
2020-02-24 22:45:49 -05:00
Philippe Tillet
cc094936d0
[PYTHON][SETUP] Removed obsolete debug print()
2020-02-24 21:35:45 -05:00
Philippe Tillet
d0ca849be2
[DOCUMENTATION] Updated installation instructions in README.md
2020-02-24 18:07:25 -05:00
Philippe Tillet
a59c24ab67
[CORE] Added missing sha1.hpp file
2020-02-24 18:02:18 -05:00
Philippe Tillet
8c35bd775f
[PYTHON] Added missing files for nn submodule
2020-02-24 17:58:24 -05:00
Philippe Tillet
67c633aa2d
[PYTHON] Better packaging
2020-02-24 17:46:20 -05:00
Philippe Tillet
f2daff85d2
[GENERAL] Improved caching mechanism:
...
* Now computing hash in libtriton
* Now only compiling a single pytorch hook per function signature
2020-02-24 16:36:50 -05:00
Philippe Tillet
29c38b38e5
[PYTHON][OPS][EINSUM] Now throwing error for automatic differentiation
...
of extended einsum
2020-02-20 17:29:01 -05:00
Philippe Tillet
9693fe1441
[PYTHON][OP][EINSUM] simplified API
2020-02-19 23:42:22 -05:00
Philippe Tillet
c30c4861be
[PYTHON][OPS][EINSUM] Added support for inner tensor strides
2020-02-19 11:50:17 -05:00
Philippe Tillet
7621aeda3f
[CODEGEN][TRANSFORM][PEEPHOLE] Fixed bug in *1 multiplication
2020-02-19 00:18:55 -05:00
Philippe Tillet
304b003969
[PYTHON][EXAMPLES] Removed obsolete files
2020-02-18 12:26:06 -05:00
Philippe Tillet
d11d2db6ee
[PYTHON][EINSUM] Now handling reduction sizes that are not a multiple of
...
TK
2020-02-17 13:52:58 -05:00
Philippe Tillet
fcdc65ffb0
[PYTHON][OPS][EINSUM] Added support for masked accumulator
2020-02-13 18:11:12 -05:00
Philippe Tillet
ae462e4fa1
[DOCS][TUTORIALS] Fixed typo
2020-02-10 04:22:45 -05:00
Philippe Tillet
6a4d42c1b8
[PYTHON][CORE] Deprecating Tensorflow support
2020-02-10 04:20:33 -05:00
Philippe Tillet
7c37258549
[DOCS] Fixed README.md
2020-02-10 03:47:49 -05:00
Philippe Tillet
0493863ee9
[DOC] Fixed formatting issues in tutorial
2020-02-10 03:27:36 -05:00
Philippe Tillet
868922e9b0
[DOC] Added more tutorials
2020-02-10 03:18:23 -05:00
Philippe Tillet
56330f1fdf
[DOC] Basic Sphynx skeleton
2020-02-10 01:56:45 -05:00
Philippe Tillet
a099c6f7f3
[TRITON][LANG] Added support for bitcast
2020-02-09 20:11:13 -05:00
Philippe Tillet
0767c27b3b
[README] Added comment about ISAAC
2020-02-06 14:09:09 -05:00
Philippe Tillet
4ff0205275
[GENERAL] Updated README.md
2020-02-06 13:59:42 -05:00
Philippe Tillet
5a3c30148e
[PYTHON][EXAMPLES] Changed shape of einsum examples
2020-02-06 13:57:30 -05:00
Philippe Tillet
e6fc28050a
[GENERAL] Updated README.md
2020-02-06 13:20:25 -05:00
Philippe Tillet
48a4dc172f
[GENERAL] ISAAC -> Triton
2020-02-06 00:49:18 -05:00
Philippe Tillet
fa29e63838
[GENERAL] Deleted ISAAC Files
2020-02-06 00:48:45 -05:00
Philippe Tillet
3e92901bd5
[TRITON][PYTHON] Cleaned up API
2020-02-05 19:44:19 -05:00
Philippe Tillet
2fcf5cec5b
[TRITON][CODEGEN] Fixed flawed assert()
2020-01-24 15:25:00 -05:00
Philippe Tillet
db941161ed
[PYTHON][EXAMPLES] Cleaned self-attention benchmarks
2020-01-22 18:09:00 -05:00
Philippe Tillet
ce7a00674a
[PYTHON][EXAMPLES] Added self-attention example using triton.ops.einsum
2020-01-21 16:45:04 -05:00
Philippe Tillet
78b98fb7cf
[GENERAL] Cleaned polymorphic structure of layouts analysis pass
2020-01-21 11:38:39 -05:00
Philippe Tillet
382ca2c745
[CODEGEN][ANALYSIS] cleaning: moving towards better polymorphism for
...
tile layouts
2020-01-20 12:43:04 -05:00
Philippe Tillet
fbf2a3f56f
[CODEGEN][TRANSFORM] some bug-fixes for FP32 einsum
2020-01-20 12:42:53 -05:00
Philippe Tillet
f278d9741a
[GENERAL] Merged einsum feature branch. Various feature, performance
...
improvements and bugfixes:
* Added preliminary support for extended Einstein summation in PyTriton
* Significant performance improvement on FP32 kernels containing matrix
multiplication
* Added re-coalescing pass for FP16 kernels containing matrix
multiplication
* Various bugfixes
2020-01-20 12:42:48 -05:00