triton

Author	SHA1	Message	Date
Keren Zhou	3ed36dcb4d	[BACKEND] MMA->DotOperand conversion for chain dot of float32 tensors (#962 ) Co-authored-by: Philippe Tillet <phil@openai.com>	2022-12-08 20:11:51 +00:00
Keren Zhou	83f3b9165b	[FRONTEND][BACKEND] Fix bool and int8 load when the other operand is given (#968 )	2022-12-08 11:52:18 -08:00
Keren Zhou	71c35bcf9c	[Triton-MLIR][BACKEND] Mark float to integer in Arithmetic Dialect as legal (#963 )	2022-12-08 09:07:01 -08:00
Philippe Tillet	532e10cf87	[FRONTEND][BACKEND] Clean-up transpositions (#953 )	2022-12-06 09:32:13 -08:00
Crutcher Dunnavant	9490252261	[FRONTEND] Support alternative install locations of system libdevice.10.bc (#951 )	2022-12-06 03:41:44 +00:00
Philippe Tillet	8edfe813a5	[FRONTEND][BACKEND] Added `trans` instruction; made flash attention bwd pass work (#943 )	2022-12-03 09:58:24 -08:00
donproc	9def1bcebf	[TRITON-MLIR][FRONTEND]minor fix to run through atomic_cas test (#925 ) Co-authored-by: dongdongl <dongdongl@nvidia.com>	2022-12-01 13:43:26 +00:00
Keren Zhou	35c9ec1103	[Triton-MLIR][Backend] Fix number of warps and threads per warp when matrices are small (#917 )	2022-11-26 12:30:38 -08:00
donproc	f63be0e9b5	[TRITON-MLIR][BACKEND]support atomic_cas (#914 ) 1. support atomics-cas 2. add xchg support in atomic_rmw Co-authored-by: dongdongl <dongdongl@nvidia.com>	2022-11-25 12:02:08 +08:00
donproc	8925c2cd11	[TRITON-MLIR][BACKEND]AtomicRMWOp supports scalar (#903 ) AtomicRMWOp supports scalar Co-authored-by: dongdongl <dongdongl@nvidia.com>	2022-11-23 07:59:09 +00:00
donproc	afaf59b0c9	[TRITON-MLIR][BACKEND] Atomic support mask (#889 ) Co-authored-by: dongdongl <dongdongl@nvidia.com>	2022-11-19 19:57:19 +08:00
donproc	5eee738df7	[Triton-MLIR][FRONTEND] [BACKEND] fix atomics (#879 ) minor fix to backend and frontend of atomics, we can pass 1 test without mask and the shape aligned with CTA size now Co-authored-by: dongdongl <dongdongl@nvidia.com>	2022-11-16 12:25:15 +08:00
Chenggang Zhao	57fd1864a7	[Triton-MLIR] Support FP8 (#864 ) Co-authored-by: Superjomn <yanchunwei@outlook.com>	2022-11-10 15:53:06 +08:00
Philippe Tillet	976cf12af1	[OPTIMIZER] Fixed memory coalescing (#847 )	2022-11-07 06:22:18 -08:00
ben-zhang-609	84ad215268	[Triton-MLIR] Enable libdevice for ptx backend when has external functions. (#848 ) At the phase from ptx to cubin, check whether llvm::Module has external functions. if has, link with libdevice at: https://github.com/openai/triton/blob/triton-mlir/python/triton/language/libdevice.10.bc	2022-11-07 08:01:50 +00:00
Keren Zhou	4218e68d74	[Triton-MLIR] [Frontend] Return a scalar if all input args are scalar (#839 )	2022-11-03 20:27:47 -07:00
ben-zhang-609	5feb6e24f9	[Triton-MLIR]Add ptx vprintf support (#825 ) Not know how to write unit test for this feature. Co-authored-by: Yan Chunwei <yanchunwei@outlook.com>	2022-11-02 16:39:09 +08:00
Philippe Tillet	12d60cb4a3	[BACKEND] Added support for 1D conversion blocked -> slice (#831 )	2022-11-01 13:19:58 -07:00
Qingyi Liu	cdc0ec5077	[Triton-MLIR][Backend] Fix reduce conversion and unit tests for int dtypes (#826 )	2022-11-01 17:42:59 +08:00
Philippe Tillet	cb1b87a688	[FRONTEND] Made test_if/test_default pass (#823 )	2022-10-30 15:32:55 -07:00
Philippe Tillet	e61dc75942	[FRONTEND] Fixed inliner and got more tests to pass (#822 ) This adds a `DialectInlinerInterface` to the Triton dialect. This, along with a few other minor semantic changes, fixes our tests on call instructions. Also added the option to provide use an "LLVM_SYSPATH" environment variable to link against locally build of LLVM; this was useful for debugging this issue.	2022-10-30 14:10:02 -07:00
Philippe Tillet	7dfab26a39	[FRONTEND][BACKEND] Fixed various bugs (#819 ) - Fixed bugs on layout conversions for int1 data (we should use int8 internally for int1 data to prevent llvm from using vec<i1> which has different semantics) - Fixed semantics of some casts to bool in the frontend	2022-10-29 06:34:14 +00:00
Philippe Tillet	ac0f6793cc	[BACKEND] Added support for scalars in LoadOp / StoreOp / ElementwiseOp (#814 ) Also fixed various errors that showed up in `test_core.py`, and added more TODOs for open (hopefully relatively minor) issues	2022-10-28 16:17:55 +08:00
Philippe Tillet	3e6cc6d66c	[FRONTEND] Made more tests pass (#805 )	2022-10-26 17:47:33 -07:00
Philippe Tillet	a2cbe7af91	[FRONTEND] Enhanced support for binary operators (#801 ) Disabled modulo test (due to change in behavior for `frem` in nvptx between llvm-11 and llvm-14) and bfloat16 (will require some work to emulate in software similar to how it's done in `master`)	2022-10-24 19:47:01 -07:00

25 Commits