Philippe Tillet
ddd89e1b22
[GENERAL] Fixed some undefined behavior with GCC-9
2021-07-27 12:38:48 -07:00
Philippe Tillet
0516ea96d0
[CODEGEN] Fixed bug that caused missing recoalescing for some transpose
...
operations
2021-07-27 12:38:48 -07:00
Philippe Tillet
0c5bd7563a
[README] Improved wording
2021-07-27 12:38:48 -07:00
Philippe Tillet
f35b9100e2
[PYTHON] Restored compatibility with powerpc
2021-07-27 12:38:48 -07:00
Philippe Tillet
1426b103e9
[PYTHON] Removed -std=gnu++11 in extra_cflags
2021-07-27 12:38:48 -07:00
Philippe Tillet
04a9ea060b
[GENERAL] Added compatibility with pytorch 1.2.0 and powerpc
2021-07-27 12:38:48 -07:00
Philippe Tillet
9984ee8c7a
[DOCS] Added pip command in README.md
2021-07-27 12:38:48 -07:00
Philippe Tillet
32d615f8f8
[DOCS] Now specifying pip command in installation.rst
2021-07-27 12:38:48 -07:00
Phillippe Tillet
ab75fbccc0
Merge pull request #38 from jack-willturner/master
...
Add working examples to tutorials and python examples folder
2021-07-27 12:38:48 -07:00
Philippe Tillet
609ef3a24d
[CORE] Fixed bug for Multi-GPU
2021-07-27 12:38:48 -07:00
jack-willturner
180ed26b61
[DOCS] Transposition fix
2021-07-27 12:38:48 -07:00
Philippe Tillet
24586e60aa
[PACKAGING] sdist now generates working .tar.gz file
2021-07-27 12:38:48 -07:00
jack-willturner
0920da6fae
Merge https://github.com/ptillet/triton
2021-07-27 12:38:48 -07:00
Philippe Tillet
769c1180c5
[PACKAGING] Fixed import error
2021-07-27 12:38:48 -07:00
jack-willturner
a98a2db2c2
[DOCS] Matrix copy and transpose
2021-07-27 12:38:48 -07:00
Philippe Tillet
435acbf585
[PACKAGING] Added MANIFEST.in and some symlinks for better packaging
2021-07-27 12:38:48 -07:00
jack-willturner
32819dea51
[DOCS] Matmul and vecadd working examples
2021-07-27 12:38:48 -07:00
Philippe Tillet
ce4a4728f5
[PACKAGING] Fixed typo in setup.py
2021-07-27 12:38:48 -07:00
Philippe Tillet
3709f564e1
[PACKAGING] Added some more files for packaging
2021-07-27 12:38:48 -07:00
Philippe Tillet
c73dee080c
[CODEGEN] Fixed bug for phi nodes with constant incoming value
2021-07-27 12:38:48 -07:00
Philippe Tillet
54805596f5
[CODEGEN][ANALYSIS] bugfix in alignment analysis
2021-07-27 12:38:48 -07:00
Philippe Tillet
f805ff278a
[PYTHON][SRC][BINDING] Improved code portability across compilers
2021-07-27 12:38:48 -07:00
Philippe Tillet
c36ad6bf8a
[PYTHON][EXAMPLES][EINSUM] Updated configs for matmul
2021-07-27 12:38:48 -07:00
Philippe Tillet
7924642b78
[PYTHON][EXAMPLES][EINSUM] Added stride in CONV2D example
2021-07-27 12:38:48 -07:00
Philippe Tillet
f22ad0064c
[PYTHON][EXAMPLES][EINSUM] Added group-convolution test/benchmark
2021-07-27 12:38:48 -07:00
Philippe Tillet
5bb977173f
[PYTHON][EINSUM] re-established auto-tuning
2021-07-27 12:38:48 -07:00
Philippe Tillet
ec2cb2155e
[TESTS] Simplified testing of half-precision transposes
2021-07-27 12:38:48 -07:00
Philippe Tillet
4ae0e28b32
[PYTHON][KERNEL] Added thread-safety when caching custom torch op
2021-07-27 12:38:48 -07:00
Philippe Tillet
677ccfb44e
[CORE][RUNTIME] Better error message on internal compilation error
2021-07-27 12:38:48 -07:00
Philippe Tillet
94e8ee7f01
[PYTHON][KERNEL] Better handling of case where cache directory already
...
exists
2021-07-27 12:38:48 -07:00
Philippe Tillet
5943baa53f
[GENERAL] Error messages now no longer make terminal color green
2021-07-27 12:38:48 -07:00
Philippe Tillet
3304629de9
[CORE] Fixed several issues that arose in the development of the
...
torch-blocksparse package:
* Now using warp shuffle in reductions when possible
* Various bugfixes in layout inference
* Added INFINITY, exponential and select
* Better error messages for unimplemented constructs
2021-07-27 12:38:48 -07:00
Phillippe Tillet
ac26fbdc1f
Merge pull request #35 from jack-willturner/iss33
...
[PYTHON] Add empty string to llvm-config versions in setup.py
2021-07-27 12:38:48 -07:00
Philippe Tillet
55c800e632
[DOCS] Fixed typo: triton.function -> torch.autograd.Function
2021-07-27 12:38:48 -07:00
Jack Turner
33d7619482
[PYTHON] Add empty string to llvm-config versions in setup.py
2021-07-27 12:38:48 -07:00
Philippe Tillet
a5e3397e6e
[DOCS] Made documentation consistent with the new kernel API
2021-07-27 12:38:48 -07:00
Philippe Tillet
eadaeab299
[CORE][TOOLS][BENCH] Disabled normalization for auto-tuning
2021-07-27 12:38:48 -07:00
Philippe Tillet
9cb3fd899a
[CORE][DRIVER] Now only using PTX6.4 if CUDA10.1+ is detected
2021-07-27 12:38:48 -07:00
Philippe Tillet
f08dd0ec58
[CMAKE] target_link_directories -> link_directories
2021-07-27 12:38:48 -07:00
Philippe Tillet
9fda39f64c
[PYTHON][EXAMPLES] Removed BlockSparse examples; see
...
https://github.com/ptillet/torch-blocksparse.git
2021-07-27 12:38:48 -07:00
Philippe Tillet
268894a5ce
[PYTHON] Merged blocksparse branch:
...
* Example for blocksparse matrix multiplication
* Simplified Triton kernel API
* Revived auto-tuning in einsum
2021-07-27 12:38:48 -07:00
Philippe Tillet
ea37ba5d35
[PYTHON][OPS] Fixed typo in einsum
2021-07-27 12:38:48 -07:00
Philippe Tillet
646c49f847
[CMAKE] Fixed issue in LLVM link directory
2021-07-27 12:38:48 -07:00
Philippe Tillet
926acc2e28
[TRITON][NN][CONV] Renamed input -> x to not modify built-in functions
2021-07-27 12:38:48 -07:00
Philippe Tillet
420e36a038
[PYTHON][NN][CONV] Fixed typo in dx computation
2021-07-27 12:38:48 -07:00
Philippe Tillet
01154f24db
[PYTHON][SETUP] Removed obsolete debug print()
2021-07-27 12:38:48 -07:00
Philippe Tillet
b3c03e24de
[DOCUMENTATION] Updated installation instructions in README.md
2021-07-27 12:38:48 -07:00
Philippe Tillet
3b9672443c
[CORE] Added missing sha1.hpp file
2021-07-27 12:38:48 -07:00
Philippe Tillet
ecb0d81b2d
[PYTHON] Added missing files for nn submodule
2021-07-27 12:38:48 -07:00
Philippe Tillet
3d769b57e2
[PYTHON] Better packaging
2021-07-27 12:38:48 -07:00