Commit Graph

166 Commits

Author SHA1 Message Date
Philippe Tillet
f35b9100e2 [PYTHON] Restored compatibility with powerpc 2021-07-27 12:38:48 -07:00
Philippe Tillet
1426b103e9 [PYTHON] Removed -std=gnu++11 in extra_cflags 2021-07-27 12:38:48 -07:00
Philippe Tillet
04a9ea060b [GENERAL] Added compatibility with pytorch 1.2.0 and powerpc 2021-07-27 12:38:48 -07:00
Philippe Tillet
9984ee8c7a [DOCS] Added pip command in README.md 2021-07-27 12:38:48 -07:00
Philippe Tillet
32d615f8f8 [DOCS] Now specifying pip command in installation.rst 2021-07-27 12:38:48 -07:00
Phillippe Tillet
ab75fbccc0 Merge pull request #38 from jack-willturner/master
Add working examples to tutorials and python examples folder
2021-07-27 12:38:48 -07:00
Philippe Tillet
609ef3a24d [CORE] Fixed bug for Multi-GPU 2021-07-27 12:38:48 -07:00
jack-willturner
180ed26b61 [DOCS] Transposition fix 2021-07-27 12:38:48 -07:00
Philippe Tillet
24586e60aa [PACKAGING] sdist now generates working .tar.gz file 2021-07-27 12:38:48 -07:00
jack-willturner
0920da6fae Merge https://github.com/ptillet/triton 2021-07-27 12:38:48 -07:00
Philippe Tillet
769c1180c5 [PACKAGING] Fixed import error 2021-07-27 12:38:48 -07:00
jack-willturner
a98a2db2c2 [DOCS] Matrix copy and transpose 2021-07-27 12:38:48 -07:00
Philippe Tillet
435acbf585 [PACKAGING] Added MANIFEST.in and some symlinks for better packaging 2021-07-27 12:38:48 -07:00
jack-willturner
32819dea51 [DOCS] Matmul and vecadd working examples 2021-07-27 12:38:48 -07:00
Philippe Tillet
ce4a4728f5 [PACKAGING] Fixed typo in setup.py 2021-07-27 12:38:48 -07:00
Philippe Tillet
3709f564e1 [PACKAGING] Added some more files for packaging 2021-07-27 12:38:48 -07:00
Philippe Tillet
c73dee080c [CODEGEN] Fixed bug for phi nodes with constant incoming value 2021-07-27 12:38:48 -07:00
Philippe Tillet
54805596f5 [CODEGEN][ANALYSIS] bugfix in alignment analysis 2021-07-27 12:38:48 -07:00
Philippe Tillet
f805ff278a [PYTHON][SRC][BINDING] Improved code portability across compilers 2021-07-27 12:38:48 -07:00
Philippe Tillet
c36ad6bf8a [PYTHON][EXAMPLES][EINSUM] Updated configs for matmul 2021-07-27 12:38:48 -07:00
Philippe Tillet
7924642b78 [PYTHON][EXAMPLES][EINSUM] Added stride in CONV2D example 2021-07-27 12:38:48 -07:00
Philippe Tillet
f22ad0064c [PYTHON][EXAMPLES][EINSUM] Added group-convolution test/benchmark 2021-07-27 12:38:48 -07:00
Philippe Tillet
5bb977173f [PYTHON][EINSUM] re-established auto-tuning 2021-07-27 12:38:48 -07:00
Philippe Tillet
ec2cb2155e [TESTS] Simplified testing of half-precision transposes 2021-07-27 12:38:48 -07:00
Philippe Tillet
4ae0e28b32 [PYTHON][KERNEL] Added thread-safety when caching custom torch op 2021-07-27 12:38:48 -07:00
Philippe Tillet
677ccfb44e [CORE][RUNTIME] Better error message on internal compilation error 2021-07-27 12:38:48 -07:00
Philippe Tillet
94e8ee7f01 [PYTHON][KERNEL] Better handling of case where cache directory already
exists
2021-07-27 12:38:48 -07:00
Philippe Tillet
5943baa53f [GENERAL] Error messages now no longer make terminal color green 2021-07-27 12:38:48 -07:00
Philippe Tillet
3304629de9 [CORE] Fixed several issues that arose in the development of the
torch-blocksparse package:

* Now using warp shuffle in reductions when possible
* Various bugfixes in layout inference
* Added INFINITY, exponential and select
* Better error messages for unimplemented constructs
2021-07-27 12:38:48 -07:00
Phillippe Tillet
ac26fbdc1f Merge pull request #35 from jack-willturner/iss33
[PYTHON] Add empty string to llvm-config versions in setup.py
2021-07-27 12:38:48 -07:00
Philippe Tillet
55c800e632 [DOCS] Fixed typo: triton.function -> torch.autograd.Function 2021-07-27 12:38:48 -07:00
Jack Turner
33d7619482 [PYTHON] Add empty string to llvm-config versions in setup.py 2021-07-27 12:38:48 -07:00
Philippe Tillet
a5e3397e6e [DOCS] Made documentation consistent with the new kernel API 2021-07-27 12:38:48 -07:00
Philippe Tillet
eadaeab299 [CORE][TOOLS][BENCH] Disabled normalization for auto-tuning 2021-07-27 12:38:48 -07:00
Philippe Tillet
9cb3fd899a [CORE][DRIVER] Now only using PTX6.4 if CUDA10.1+ is detected 2021-07-27 12:38:48 -07:00
Philippe Tillet
f08dd0ec58 [CMAKE] target_link_directories -> link_directories 2021-07-27 12:38:48 -07:00
Philippe Tillet
9fda39f64c [PYTHON][EXAMPLES] Removed BlockSparse examples; see
https://github.com/ptillet/torch-blocksparse.git
2021-07-27 12:38:48 -07:00
Philippe Tillet
268894a5ce [PYTHON] Merged blocksparse branch:
* Example for blocksparse matrix multiplication
* Simplified Triton kernel API
* Revived auto-tuning in einsum
2021-07-27 12:38:48 -07:00
Philippe Tillet
ea37ba5d35 [PYTHON][OPS] Fixed typo in einsum 2021-07-27 12:38:48 -07:00
Philippe Tillet
646c49f847 [CMAKE] Fixed issue in LLVM link directory 2021-07-27 12:38:48 -07:00
Philippe Tillet
926acc2e28 [TRITON][NN][CONV] Renamed input -> x to not modify built-in functions 2021-07-27 12:38:48 -07:00
Philippe Tillet
420e36a038 [PYTHON][NN][CONV] Fixed typo in dx computation 2021-07-27 12:38:48 -07:00
Philippe Tillet
01154f24db [PYTHON][SETUP] Removed obsolete debug print() 2021-07-27 12:38:48 -07:00
Philippe Tillet
b3c03e24de [DOCUMENTATION] Updated installation instructions in README.md 2021-07-27 12:38:48 -07:00
Philippe Tillet
3b9672443c [CORE] Added missing sha1.hpp file 2021-07-27 12:38:48 -07:00
Philippe Tillet
ecb0d81b2d [PYTHON] Added missing files for nn submodule 2021-07-27 12:38:48 -07:00
Philippe Tillet
3d769b57e2 [PYTHON] Better packaging 2021-07-27 12:38:48 -07:00
Philippe Tillet
dfb844bf41 [GENERAL] Improved caching mechanism:
* Now computing hash in libtriton
* Now only compiling a single pytorch hook per function signature
2021-07-27 12:38:48 -07:00
Philippe Tillet
30f77e9ec5 [PYTHON][OPS][EINSUM] Now throwing error for automatic differentiation
of extended einsum
2021-07-27 12:38:48 -07:00
Philippe Tillet
4e50ef4076 [PYTHON][OP][EINSUM] simplified API 2021-07-27 12:38:48 -07:00