Commit Graph

459 Commits

Author SHA1 Message Date
Philippe Tillet
64eaec016f [Version] Now version 0.2.3 2021-07-27 12:38:48 -07:00
Philippe Tillet
db4e4b9dbf [VERSION] Now version 0.2.2 2021-07-27 12:38:48 -07:00
Philippe Tillet
7af9d812cf [PYTHON] Added credits to Scott Gray for the idea used in launch.cc 2021-07-27 12:38:48 -07:00
Philippe Tillet
acff1b5e05 [RUNTIME] Lower-level interface for executing functions 2021-07-27 12:38:48 -07:00
Philippe Tillet
ba9955ae39 [CODEGEN][ANALYSIS] Fixed issue in layout inference 2021-07-27 12:38:48 -07:00
Philippe Tillet
89e456107b [EXAMPLES] Improved mat_mul example 2021-07-27 12:38:48 -07:00
Philippe Tillet
68c18238a9 [EXAMPLES] Added conv2d example 2021-07-27 12:38:48 -07:00
Philippe Tillet
46297a949f [PACKAGING] Now version 0.2.1 2021-07-27 12:38:48 -07:00
Philippe Tillet
c251dc50f3 [PACKAGING] Now version 0.2.0 2021-07-27 12:38:48 -07:00
Philippe Tillet
4ccd78f1a6 [EXAMPLES][TUTORIAL] Changed to new triton.kernel API 2021-07-27 12:38:48 -07:00
Philippe Tillet
c33d6d15f5 [TRITON][PYTHON] Reverted back to distutils 2021-07-27 12:38:48 -07:00
Philippe Tillet
955b027103 [TRITON][KERNEL] Fixed issue for concurrent compilation of torch
extensions
2021-07-27 12:38:48 -07:00
Philippe Tillet
d85141182d [PACKAGING] Now version 0.1.3 2021-07-27 12:38:48 -07:00
Philippe Tillet
5995cbff8e [CORE] Auto-tuning now copies scalar buffers. Still needs to copy all buffers that are both read from and written to. 2021-07-27 12:38:48 -07:00
Philippe Tillet
78cd54b0c8 [PYTHON] Added support for FP16 scalar kernel arguments 2021-07-27 12:38:48 -07:00
Philippe Tillet
694bfbddf9 [PACKAGING] Now version 0.1.2 2021-07-27 12:38:48 -07:00
Philippe Tillet
13ff6472e0 [LANG] Fixed undefined behavior in replace_all_uses_with() 2021-07-27 12:38:48 -07:00
Philippe Tillet
f35b9100e2 [PYTHON] Restored compatibility with powerpc 2021-07-27 12:38:48 -07:00
Philippe Tillet
1426b103e9 [PYTHON] Removed -std=gnu++11 in extra_cflags 2021-07-27 12:38:48 -07:00
Philippe Tillet
04a9ea060b [GENERAL] Added compatibility with pytorch 1.2.0 and powerpc 2021-07-27 12:38:48 -07:00
jack-willturner
180ed26b61 [DOCS] Transposition fix 2021-07-27 12:38:48 -07:00
jack-willturner
0920da6fae Merge https://github.com/ptillet/triton 2021-07-27 12:38:48 -07:00
jack-willturner
a98a2db2c2 [DOCS] Matrix copy and transpose 2021-07-27 12:38:48 -07:00
Philippe Tillet
609ef3a24d [CORE] Fixed bug for Multi-GPU 2021-07-27 12:38:48 -07:00
jack-willturner
32819dea51 [DOCS] Matmul and vecadd working examples 2021-07-27 12:38:48 -07:00
Philippe Tillet
24586e60aa [PACKAGING] sdist now generates working .tar.gz file 2021-07-27 12:38:48 -07:00
Philippe Tillet
ce4a4728f5 [PACKAGING] Fixed typo in setup.py 2021-07-27 12:38:48 -07:00
Philippe Tillet
769c1180c5 [PACKAGING] Fixed import error 2021-07-27 12:38:48 -07:00
Philippe Tillet
3709f564e1 [PACKAGING] Added some more files for packaging 2021-07-27 12:38:48 -07:00
Philippe Tillet
435acbf585 [PACKAGING] Added MANIFEST.in and some symlinks for better packaging 2021-07-27 12:38:48 -07:00
Philippe Tillet
f805ff278a [PYTHON][SRC][BINDING] Improved code portability across compilers 2021-07-27 12:38:48 -07:00
Philippe Tillet
c36ad6bf8a [PYTHON][EXAMPLES][EINSUM] Updated configs for matmul 2021-07-27 12:38:48 -07:00
Philippe Tillet
7924642b78 [PYTHON][EXAMPLES][EINSUM] Added stride in CONV2D example 2021-07-27 12:38:48 -07:00
Philippe Tillet
f22ad0064c [PYTHON][EXAMPLES][EINSUM] Added group-convolution test/benchmark 2021-07-27 12:38:48 -07:00
Philippe Tillet
5bb977173f [PYTHON][EINSUM] re-established auto-tuning 2021-07-27 12:38:48 -07:00
Philippe Tillet
4ae0e28b32 [PYTHON][KERNEL] Added thread-safety when caching custom torch op 2021-07-27 12:38:48 -07:00
Philippe Tillet
94e8ee7f01 [PYTHON][KERNEL] Better handling of case where cache directory already
exists
2021-07-27 12:38:48 -07:00
Philippe Tillet
5943baa53f [GENERAL] Error messages now no longer make terminal color green 2021-07-27 12:38:48 -07:00
Philippe Tillet
3304629de9 [CORE] Fixed several issues that arose in the development of the
torch-blocksparse package:

* Now using warp shuffle in reductions when possible
* Various bugfixes in layout inference
* Added INFINITY, exponential and select
* Better error messages for unimplemented constructs
2021-07-27 12:38:48 -07:00
Jack Turner
33d7619482 [PYTHON] Add empty string to llvm-config versions in setup.py 2021-07-27 12:38:48 -07:00
Philippe Tillet
9fda39f64c [PYTHON][EXAMPLES] Removed BlockSparse examples; see
https://github.com/ptillet/torch-blocksparse.git
2021-07-27 12:38:48 -07:00
Philippe Tillet
268894a5ce [PYTHON] Merged blocksparse branch:
* Example for blocksparse matrix multiplication
* Simplified Triton kernel API
* Revived auto-tuning in einsum
2021-07-27 12:38:48 -07:00
Philippe Tillet
ea37ba5d35 [PYTHON][OPS] Fixed typo in einsum 2021-07-27 12:38:48 -07:00
Philippe Tillet
926acc2e28 [TRITON][NN][CONV] Renamed input -> x to not modify built-in functions 2021-07-27 12:38:48 -07:00
Philippe Tillet
420e36a038 [PYTHON][NN][CONV] Fixed typo in dx computation 2021-07-27 12:38:48 -07:00
Philippe Tillet
01154f24db [PYTHON][SETUP] Removed obsolete debug print() 2021-07-27 12:38:48 -07:00
Philippe Tillet
ecb0d81b2d [PYTHON] Added missing files for nn submodule 2021-07-27 12:38:48 -07:00
Philippe Tillet
3d769b57e2 [PYTHON] Better packaging 2021-07-27 12:38:48 -07:00
Philippe Tillet
dfb844bf41 [GENERAL] Improved caching mechanism:
* Now computing hash in libtriton
* Now only compiling a single pytorch hook per function signature
2021-07-27 12:38:48 -07:00
Philippe Tillet
30f77e9ec5 [PYTHON][OPS][EINSUM] Now throwing error for automatic differentiation
of extended einsum
2021-07-27 12:38:48 -07:00