Philippe Tillet
f805ff278a
[PYTHON][SRC][BINDING] Improved code portability across compilers
2021-07-27 12:38:48 -07:00
Philippe Tillet
c36ad6bf8a
[PYTHON][EXAMPLES][EINSUM] Updated configs for matmul
2021-07-27 12:38:48 -07:00
Philippe Tillet
7924642b78
[PYTHON][EXAMPLES][EINSUM] Added stride in CONV2D example
2021-07-27 12:38:48 -07:00
Philippe Tillet
f22ad0064c
[PYTHON][EXAMPLES][EINSUM] Added group-convolution test/benchmark
2021-07-27 12:38:48 -07:00
Philippe Tillet
5bb977173f
[PYTHON][EINSUM] re-established auto-tuning
2021-07-27 12:38:48 -07:00
Philippe Tillet
ec2cb2155e
[TESTS] Simplified testing of half-precision transposes
2021-07-27 12:38:48 -07:00
Philippe Tillet
4ae0e28b32
[PYTHON][KERNEL] Added thread-safety when caching custom torch op
2021-07-27 12:38:48 -07:00
Philippe Tillet
677ccfb44e
[CORE][RUNTIME] Better error message on internal compilation error
2021-07-27 12:38:48 -07:00
Philippe Tillet
94e8ee7f01
[PYTHON][KERNEL] Better handling of case where cache directory already
...
exists
2021-07-27 12:38:48 -07:00
Philippe Tillet
5943baa53f
[GENERAL] Error messages now no longer make terminal color green
2021-07-27 12:38:48 -07:00
Philippe Tillet
3304629de9
[CORE] Fixed several issues that arose in the development of the
...
torch-blocksparse package:
* Now using warp shuffle in reductions when possible
* Various bugfixes in layout inference
* Added INFINITY, exponential and select
* Better error messages for unimplemented constructs
2021-07-27 12:38:48 -07:00
Phillippe Tillet
ac26fbdc1f
Merge pull request #35 from jack-willturner/iss33
...
[PYTHON] Add empty string to llvm-config versions in setup.py
2021-07-27 12:38:48 -07:00
Philippe Tillet
55c800e632
[DOCS] Fixed typo: triton.function -> torch.autograd.Function
2021-07-27 12:38:48 -07:00
Jack Turner
33d7619482
[PYTHON] Add empty string to llvm-config versions in setup.py
2021-07-27 12:38:48 -07:00
Philippe Tillet
a5e3397e6e
[DOCS] Made documentation consistent with the new kernel API
2021-07-27 12:38:48 -07:00
Philippe Tillet
eadaeab299
[CORE][TOOLS][BENCH] Disabled normalization for auto-tuning
2021-07-27 12:38:48 -07:00
Philippe Tillet
9cb3fd899a
[CORE][DRIVER] Now only using PTX6.4 if CUDA10.1+ is detected
2021-07-27 12:38:48 -07:00
Philippe Tillet
f08dd0ec58
[CMAKE] target_link_directories -> link_directories
2021-07-27 12:38:48 -07:00
Philippe Tillet
9fda39f64c
[PYTHON][EXAMPLES] Removed BlockSparse examples; see
...
https://github.com/ptillet/torch-blocksparse.git
2021-07-27 12:38:48 -07:00
Philippe Tillet
268894a5ce
[PYTHON] Merged blocksparse branch:
...
* Example for blocksparse matrix multiplication
* Simplified Triton kernel API
* Revived auto-tuning in einsum
2021-07-27 12:38:48 -07:00
Philippe Tillet
ea37ba5d35
[PYTHON][OPS] Fixed typo in einsum
2021-07-27 12:38:48 -07:00
Philippe Tillet
646c49f847
[CMAKE] Fixed issue in LLVM link directory
2021-07-27 12:38:48 -07:00
Philippe Tillet
926acc2e28
[TRITON][NN][CONV] Renamed input -> x to not modify built-in functions
2021-07-27 12:38:48 -07:00
Philippe Tillet
420e36a038
[PYTHON][NN][CONV] Fixed typo in dx computation
2021-07-27 12:38:48 -07:00
Philippe Tillet
01154f24db
[PYTHON][SETUP] Removed obsolete debug print()
2021-07-27 12:38:48 -07:00
Philippe Tillet
b3c03e24de
[DOCUMENTATION] Updated installation instructions in README.md
2021-07-27 12:38:48 -07:00
Philippe Tillet
3b9672443c
[CORE] Added missing sha1.hpp file
2021-07-27 12:38:48 -07:00
Philippe Tillet
ecb0d81b2d
[PYTHON] Added missing files for nn submodule
2021-07-27 12:38:48 -07:00
Philippe Tillet
3d769b57e2
[PYTHON] Better packaging
2021-07-27 12:38:48 -07:00
Philippe Tillet
dfb844bf41
[GENERAL] Improved caching mechanism:
...
* Now computing hash in libtriton
* Now only compiling a single pytorch hook per function signature
2021-07-27 12:38:48 -07:00
Philippe Tillet
30f77e9ec5
[PYTHON][OPS][EINSUM] Now throwing error for automatic differentiation
...
of extended einsum
2021-07-27 12:38:48 -07:00
Philippe Tillet
4e50ef4076
[PYTHON][OP][EINSUM] simplified API
2021-07-27 12:38:48 -07:00
Philippe Tillet
26fd884d96
[PYTHON][OPS][EINSUM] Added support for inner tensor strides
2021-07-27 12:38:48 -07:00
Philippe Tillet
4181f9f2af
[CODEGEN][TRANSFORM][PEEPHOLE] Fixed bug in *1 multiplication
2021-07-27 12:38:48 -07:00
Philippe Tillet
9e54a03006
[PYTHON][EXAMPLES] Removed obsolete files
2021-07-27 12:38:48 -07:00
Philippe Tillet
3816f2f259
[PYTHON][EINSUM] Now handling reduction sizes that are not a multiple of
...
TK
2021-07-27 12:38:48 -07:00
Philippe Tillet
fa4ec7ea65
[PYTHON][OPS][EINSUM] Added support for masked accumulator
2021-07-27 12:38:48 -07:00
Philippe Tillet
69326ae6de
[DOCS][TUTORIALS] Fixed typo
2021-07-27 12:38:48 -07:00
Philippe Tillet
404dd18333
[PYTHON][CORE] Deprecating Tensorflow support
2021-07-27 12:38:48 -07:00
Philippe Tillet
d7a781dd40
[DOCS] Fixed README.md
2021-07-27 12:38:48 -07:00
Philippe Tillet
97293440fc
[DOC] Fixed formatting issues in tutorial
2021-07-27 12:38:48 -07:00
Philippe Tillet
5778eea546
[DOC] Added more tutorials
2021-07-27 12:38:48 -07:00
Philippe Tillet
12a621bb2f
[DOC] Basic Sphynx skeleton
2021-07-27 12:38:48 -07:00
Philippe Tillet
d22cf4f717
[TRITON][LANG] Added support for bitcast
2021-07-27 12:38:48 -07:00
Philippe Tillet
7a40077bfd
[README] Added comment about ISAAC
2021-07-27 12:38:48 -07:00
Philippe Tillet
327ae489df
[GENERAL] Updated README.md
2021-07-27 12:38:48 -07:00
Philippe Tillet
558422c18a
[PYTHON][EXAMPLES] Changed shape of einsum examples
2021-07-27 12:38:48 -07:00
Philippe Tillet
6d7cf35123
History prior to this date belonged to the now deprecated ISAAC project, and was deleted to save space
2021-07-27 12:38:38 -07:00