triton

Author	SHA1	Message	Date
Philippe Tillet	4e93b41c52	[GENERAL] Some minor fixups (#393 ) * [RUNTIME] Now displaying error message when generated PTX is invalid * [CODEGEN] Now converting `if` condition to bool implicitly	2021-12-17 18:06:21 -08:00
Madeleine Thompson	e575ae3443	[FRONTEND] Minor accumulated style and warning fixes (#388 ) - Fix some whitespace. - Make an undeclared dependency on `pytest` explicit. - Fix deprecated `description-file` use. - `#ifdef` out a deprecated `PyEval_InitThreads` call. - Use a slightly different numpy invocation in `test_random.py` to quiet down overflow warnings in tests. - Fix a deprecated cast in `test_core.py`. - Suppress a warning about `visit_Constant` in Python 3.9+; we can't migrate yet because it'd break Python 3.6 and 3.7. - Use chained exceptions for `CompilationError` rather than rolling our own; it makes the error messages nicer. - Add a `__str__` for `tl.dtype` to make debugging kernels easier; it lets you `print` a dtype to see what type was inferred. - Fix a few bad escapes.	2021-12-10 15:19:20 -08:00
Philippe Tillet	9def2424ab	[RUNTIME] Fix typo in IfExp	2021-12-09 15:14:41 -08:00
Philippe Tillet	e31b9b4e66	[RUNTIME] Better support for `None` (#387 ) * regression test fails but it doesn't make sense to me.	2021-12-09 13:21:22 -08:00
Philippe Tillet	c86ad9c9ab	[FRONTEND] Added default arguments to non-kernel @triton.jit'd function (#379 )	2021-11-29 19:11:26 -08:00
daadaada	1296eb877b	[RUNTIME] Config hook v2.0 (#373 ) * Add pre_hook to triton.Config * Use argument names in triton.heuristics * Update base perf * Remove meta from heuristics	2021-11-21 11:20:59 -08:00
Philippe Tillet	5693b582ea	[RUNTIME] Now using pybind11 to avoid memory leaks (#377 )	2021-11-21 02:30:22 -08:00
Philippe Tillet	edd4b0c8b7	[CODEGEN] Fixed issue with jit function passed as constexpr	2021-11-16 09:53:34 -08:00
Philippe Tillet	5b7ba3eb96	[CODEGEN] Reverted to old launch method (memory leak?)	2021-11-16 01:21:03 -08:00
Philippe Tillet	791b953b21	[CODEGEN] Reverted to old way to query current stream	2021-11-16 00:17:27 -08:00
Philippe Tillet	01cc3d4503	[RUNTIME] Restored `do_not_specialize` (#374 )	2021-11-12 15:06:55 -08:00
Philippe Tillet	e66bf76354	[RUNTIME] Bunch of bugfixes (#372 )	2021-11-12 00:55:00 -08:00
Philippe Tillet	f7ab96cfd7	[FRONTEND] Fixed some issues with `constexpr`	2021-11-09 13:03:09 -08:00
Philippe Tillet	5d54352164	[FRONTEND] Significantly reduce kernel launch time (#367 )	2021-11-04 13:25:24 -07:00
Philippe Tillet	2acaa4d0dd	[LANG] Added support for constexpr (#361 )	2021-10-30 00:32:58 -07:00
daadaada	858dec8372	[CODEGEN] Add cache modifier to tl.load (#351 ) * Add cache modifier to tl.load * Add comment to cache_modifier * Remove force_nc_cache * Update test	2021-10-17 22:14:04 -07:00
Philippe Tillet	bfacc191b3	[FRONTEND] Now cache re-compiles when `language` changes (#348 )	2021-10-13 12:29:57 -07:00
Philippe Tillet	5123db0b7d	[LANG] Various (relatively minor) improvements (#320 )	2021-10-04 18:39:40 -07:00
Philippe Tillet	bfcfad7abe	[FRONTEND] Disable P2P (#312 )	2021-09-27 21:18:27 -07:00
Philippe Tillet	c3756d1c33	[FRONTEND] Add `do_not_specialize` to triton.jit to prevent specialization of kernel argument (#309 )	2021-09-24 20:27:10 -07:00
Philippe Tillet	83da3febf2	[FRONTEND] Added simple hook for when something is written to the cache (#308 )	2021-09-23 22:23:17 -07:00
Shantanu	0735061fce	[FRONTEND] fix for unpickleable keys (#307 ) In #306, I added the key to the cache data, so we can introspect to investigate cache misses. Unfortunately, the key isn't pickleable, so just add the str version instead. Co-authored-by: hauntsaninja <>	2021-09-23 21:23:59 -07:00
Shantanu	2066ccd87e	[FRONTEND] single file caches (#306 ) Co-authored-by: hauntsaninja <>	2021-09-23 20:21:19 -07:00
Shantanu	d253eb8719	[FRONTEND] Add cache_version to triton.jit (#301 )	2021-09-23 16:45:54 -07:00
Philippe Tillet	5211f23a63	[FRONTEND] updated TensorWrapper (#299 )	2021-09-22 13:53:27 -07:00
Philippe Tillet	41dbaf3b3f	[FRONTEND] Fixed typo in cache for .dumb db (#296 )	2021-09-21 17:03:41 -07:00
Philippe Tillet	c151e0f6aa	[FRONTEND] Simplified detection of corrupted cache (#295 )	2021-09-21 16:36:24 -07:00
Philippe Tillet	e96edc16ff	[FRONTEND] Compute cache now supports atomic writes (#294 ) Note that killing a Triton process while it updates the cache will result in the cache being wiped out. This is because copying a whole `db` to a temporary file can be quite expensive on some systems.	2021-09-21 14:10:02 -07:00
Philippe Tillet	a12827848d	[FRONTEND] Now using exist_ok=True when creating cache directories (#288 )	2021-09-18 23:44:21 -07:00
Philippe Tillet	6e5b0b4301	[FRONTEND] Added on-disk cache for compiled kernels (#287 )	2021-09-18 22:48:26 -07:00
Philippe Tillet	ac10551d55	[PYTHON] Now providing triton.next_power_of_2 (#273 )	2021-09-10 11:05:44 -07:00
Philippe Tillet	94c83d30ce	[GENERAL] Removed deprecated driver files and added basic compatibility with rocm (#268 ) - Removed driver module -- accelerator runtime is handled by pytorch - Added basic support for ROCM based on @micmelesse 's PR -- now can execute empty kernel on AMD devices without any compile-time changes - Now only using PREFER_SHARED for kernels when the size of shared memory is greater than 49k. Otherwise there can be poor L1 performance for broadcast tensors	2021-09-09 00:04:28 -07:00
Philippe Tillet	c069ef907e	[PYTHON] `triton.language` is now a submodule rather than a single file (#260 )	2021-09-02 13:30:14 -07:00
Rohit Dwivedula	c0daffc625	[DOCS] @heuristics -> @triton.heuristics in some snippets (#253 )	2021-09-01 18:50:17 -07:00
milesial	5b29da719d	[DRIVER] Add CUDA P2P support (#209 )	2021-08-20 21:00:54 -07:00
Philippe Tillet	f26a48a3b4	[DOCS] Various improvements (#224 ) - Added docstr for autotune, Config, heuristics - Added docstr for atomics - Hiding internal _builder argument used for built-in language primitives - Re-factor docstr to use common templates between similar functions.	2021-08-18 11:15:53 -07:00
Philippe Tillet	6e7593b446	added reset_to_zero in vector addition (#205 )	2021-08-14 10:58:38 -07:00
Philippe Tillet	c7a272cb91	[FRONTEND] Added default arguments for `range` (#203 )	2021-08-14 10:11:18 -07:00
Philippe Tillet	a34c57402f	[PYTHON] Improved error message for CPU (#167 )	2021-07-30 09:47:27 -07:00
Philippe Tillet	298aead378	[FRONTEND] Fixed bugs in global symbols resolution of @triton.jit'd functions (#136 )	2021-07-27 12:38:49 -07:00
Philippe Tillet	01276b5153	[FRONTEND] Added compilation flag to force use of `.nc` cache modifier (#134 ) in DRAM loads. /!\ USE CAREFULLY - THIS CAN BREAK CORRECTNESS IF MISUSED /!\	2021-07-27 12:38:49 -07:00
Philippe Tillet	8cea583109	[IR] Preliminary support for BF16 (#129 ) This PR adds a BF16 data-type, along with FP32 <-> BF16 conversion instructions in the LLVM codegen. Other kinds of ops on bfloat16 are not yet supported.	2021-07-27 12:38:49 -07:00
Philippe Tillet	9b4e2cae2d	[PYTHON] Added `isinstance` to list of builtin symbols exposed (#128 )	2021-07-27 12:38:49 -07:00
daadaada	d8d6b715c8	[CODEGEN] Performance improvement on A100 (#125 ) Improved codegen for the Ampere GPUs. * Make the layout pass recognize the multistage pipelined pattern. * Now the pipeline pass can automate the multistage pipelining transformation. * Remove extra barriers (from the prefetch pass & WAR) on Ampere. * Update the code generator (generator.cc) to make Triton generate n-buffered shared memory loads/stores.	2021-07-27 12:38:49 -07:00
Philippe Tillet	3ab121dbdb	[PYTHON] Added support for tuples (#116 )	2021-07-27 12:38:49 -07:00
Philippe Tillet	325ee38581	[PYTHON] Fixed bug in scoping mechanism (#111 ) Inline functions didn't restore scope of parents. Also some control flow structure still had the scoping semantics of C++	2021-07-27 12:38:49 -07:00
Philippe Tillet	288b4f7f58	[PYTHON] Added frontend to print sass using turingas disasm.py (#109 )	2021-07-27 12:38:49 -07:00
Szymon Sidor	ac57812bdc	[PYTHON] Update TensorWrapper with device attribute (#102 ) bugfix `triton.reinterpret`	2021-07-27 12:38:49 -07:00
Philippe Tillet	7355efa745	[LANG] Preliminary FP8 support (#96 )	2021-07-27 12:38:49 -07:00
Philippe Tillet	4290be1ae8	[PYTHON] Various minor codegen fixes (#95 )	2021-07-27 12:38:49 -07:00

1 2 3

106 Commits