Commit Graph

  • c529b462f5 more fixes on pipeline.cpp Yan Da 2022-05-26 13:14:41 +08:00
  • 0e2883020a [BACKEND] Fixed typo in alignment analysis (#528) Philippe Tillet 2022-05-25 20:01:19 -07:00
  • 43fec2adca [FRONTEND] Add binding for create_int_to_ptr (#526) Bert Maher 2022-05-25 18:26:18 -04:00
  • 71d1c10e19 Remove weird includes Yan Da 2022-05-25 21:54:06 +08:00
  • 9308e9c90c A more general pipeliner Yan Da 2022-05-25 21:52:51 +08:00
  • 441fd7c3cc assembly format Yan Da 2022-05-25 17:53:24 +08:00
  • e6f89a5777 Fix ReduceOp conversion Yan Da 2022-05-25 16:03:06 +08:00
  • 9b670cfb9f Add ReduceOp Yan Da 2022-05-25 14:15:36 +08:00
  • 011bc83c1b [FRONTEND] For loops now promote initial value (#524) Philippe Tillet 2022-05-24 13:20:10 -07:00
  • 96bff90471 [FRONTEND] faster jit function launch (#523) Natalia Gimelshein 2022-05-24 12:08:49 -07:00
  • d5eaa8dfa0 Making the generated Triton IR deterministic & a script to compare cached assembly (#522) daadaada 2022-05-24 23:56:36 +08:00
  • a2c9f919a8 TritonGPU verifier Yan Da 2022-05-24 19:48:56 +08:00
  • 80f6a2698b [FRONTEND] Ensure version_key is called at most once (#519) Shantanu 2022-05-23 13:40:08 -07:00
  • 36c45ec687 make numStages an option in PipelinePass Yan Da 2022-05-23 12:47:55 +08:00
  • 39b1235082 fix atomic_cas Yan Da 2022-05-22 19:43:04 +08:00
  • 205a493b10 [FRONTEND] Fix a bug in atomic_cas (correct cmp to val) & more tests on atomic_cas (#520) daadaada 2022-05-22 00:45:54 +08:00
  • 79298d61bc fix a pipeline issue Yan Da 2022-05-16 19:38:40 +08:00
  • c3c4ac3733 TritonGPU combiner Yan Da 2022-05-16 19:17:15 +08:00
  • e3916c3a46 TritonGPU combiner Yan Da 2022-05-16 19:16:01 +08:00
  • 0e68e6eb59 delete erroneous include Yan Da 2022-05-15 22:30:26 +08:00
  • 7027af9666 The pipeline pass is now functional Yan Da 2022-05-15 22:29:27 +08:00
  • abea3dc2c6 [FRONTEND] provide device kwargs && fix fstring error for py<3.8 (#515) Jiabao Lei 2022-05-15 07:21:46 +08:00
  • d35617bea1 [BACKEND][CODEGEN] Faster reduction for scanline layout (#516) Philippe Tillet 2022-05-14 15:26:13 -07:00
  • 7e0e7ec365 more progress on the pipeline pass Yan Da 2022-05-14 22:04:36 +08:00
  • d1a22a94e6 [FRONTEND] Add empty return value and remove protect to open the access to contained_tys_vec_t (#514) Mengchi Zhang 2022-05-13 11:46:12 -07:00
  • 978463ba39 more progress on the pipeline pass Yan Da 2022-05-13 21:32:35 +08:00
  • d954a05989 [FRONTEND] Handle torch.uint8 args (#513) Jason Ansel 2022-05-12 13:07:39 -07:00
  • 0835a4fb05 [TUTORIALS] Removed #noformat in layer norm tutorial Philippe Tillet 2022-05-12 12:41:25 -07:00
  • c736ba7c3e [TUTORIALS] Fixed formatting Philippe Tillet 2022-05-12 12:30:36 -07:00
  • cd30a99aa2 [TUTORIALS] fixed formatting Philippe Tillet 2022-05-12 12:28:15 -07:00
  • d23d7b244c More on the pipeline pass Yan Da 2022-05-11 20:31:08 +08:00
  • 1a4fbed25b Skeleton for the pipeline pass Yan Da 2022-05-11 16:13:53 +08:00
  • 96876a46d1 More progress on Triton=>TritonGPU conversion (works for matmul) Yan Da 2022-05-09 21:19:53 +08:00
  • d87435e536 [TUTORIALS] Layer norm tutorial now uses residency control (#510) Philippe Tillet 2022-05-05 19:53:54 -07:00
  • ebf28cdc4b [GH-PAGES] Updated website Philippe Tillet 2022-05-06 00:44:25 +00:00
  • 0c5319eed9 More progress on SCF type conversion Yan Da 2022-05-05 20:56:55 +08:00
  • 7c9bc5a47b [CODEGEN] Change return type of generator::packed_type to appease build warnings (#507) Sriram Murali 2022-05-04 20:03:37 -07:00
  • 19398e6d8a [GH-PAGES] Updated website Philippe Tillet 2022-05-05 00:44:03 +00:00
  • 26c59e4718 More on SCF conversion Yan Da 2022-05-04 21:50:32 +08:00
  • a96fe07e1c DotOp conversion Yan Da 2022-05-04 15:56:24 +08:00
  • 2d281cbc0a ConstantOp conversion pattern Yan Da 2022-05-04 15:35:43 +08:00
  • b9279d2e3b More progress on TritonGPU conversion Yan Da 2022-05-04 14:54:31 +08:00
  • 3ad7bee35e More conversion patterns Yan Da 2022-05-04 12:50:02 +08:00
  • d420763e0b [GH-PAGES] Updated website Philippe Tillet 2022-05-04 00:43:20 +00:00
  • af77440e1b [GH-PAGES] Updated website Philippe Tillet 2022-05-03 00:45:18 +00:00
  • 5f08e2fdae More arith patterns Yan Da 2022-05-02 22:31:29 +08:00
  • 75d32e2442 More on TritonGPU conversion Yan Da 2022-05-02 21:51:00 +08:00
  • 567aa8d4fc [GH-PAGES] Updated website Philippe Tillet 2022-05-02 00:42:39 +00:00
  • 1428185c9c More progress on TritonGPUTypeConverter & TritonGPUConversionTarget Yan Da 2022-05-01 22:06:54 +08:00
  • 4ece9fd1f3 Move dependentDialects from .cpp to .td Yan Da 2022-05-01 13:06:51 +08:00
  • d9017f8593 add basic template for legalizing arithmetic op Phil Tillet 2022-04-30 20:42:25 -07:00
  • 0bce260523 [GH-PAGES] Updated website Philippe Tillet 2022-05-01 00:49:06 +00:00
  • 2c6a213131 [TRITONGPU] Added template for Triton -> TritonGPU conversion Phil Tillet 2022-04-30 14:31:18 -07:00
  • 95feb10ec9 [FRONTEND] fixup (#505) Philippe Tillet 2022-04-30 14:25:06 -07:00
  • e15e7e5ae2 [GH-PAGES] Updated website Philippe Tillet 2022-04-30 00:47:08 +00:00
  • 11a908655d [FRONTEND] Fixup Philippe Tillet 2022-04-29 14:35:09 -07:00
  • cd78ce4888 [FRONTEND] Improved error message when assigning None to non-constexpr Phil Tillet 2022-04-29 09:17:54 -07:00
  • ea296daf2a [GH-PAGES] Updated website Philippe Tillet 2022-04-29 00:42:56 +00:00
  • 2239ac1998 more progress on TritonGPU Yan Da 2022-04-28 18:51:31 +08:00
  • e7903f3c77 [GH-PAGES] Updated website Philippe Tillet 2022-04-28 00:46:19 +00:00
  • 012e8c5b2b fixup Philippe Tillet 2022-04-27 16:39:27 -07:00
  • 513bcaee50 Added some ASCII art for encoding documentation Philippe Tillet 2022-04-27 16:28:27 -07:00
  • 29859605ee Remove unused files Yan Da 2022-04-27 21:20:07 +08:00
  • 38d13ae618 Some progress on TritonGPU Yan Da 2022-04-27 21:16:45 +08:00
  • edca91bf8f Update traits (NoSideEffect) Yan Da 2022-04-27 19:41:07 +08:00
  • 8dfe78f6cf Add TritonCombineOps Yan Da 2022-04-27 19:28:21 +08:00
  • c70f6b666e Merge previous changes Yan Da 2022-04-27 14:06:55 +08:00
  • 74585fb970 Add Triton CombineOps Yan Da 2022-04-27 13:45:56 +08:00
  • 9bf9b2f3d8 [GH-PAGES] Updated website Philippe Tillet 2022-04-27 00:47:37 +00:00
  • 81001d318c Putting Triton dialect in its own folder Philippe Tillet 2022-04-26 14:38:28 -07:00
  • 62a64ff29b Fixed Python link bug in CMakeLists Philippe Tillet 2022-04-25 11:38:16 -07:00
  • ae2a1ab225 [BACKEND] Alignment pass improvements (#503) Philippe Tillet 2022-04-25 21:16:00 -07:00
  • b0a569b724 [GH-PAGES] Updated website Philippe Tillet 2022-04-26 00:43:32 +00:00
  • 7d544799a0 [BACKEND] Now disabling L2 eviction policy for sm < 80 Philippe Tillet 2022-04-25 09:35:36 -07:00
  • 21613349ac [GH-PAGES] Updated website Philippe Tillet 2022-04-25 00:41:43 +00:00
  • 3ca792043f [TEST] Added test for vectorization Philippe Tillet 2022-04-24 13:32:35 -07:00
  • 1581cf9d79 [GH-PAGES] Updated website Philippe Tillet 2022-04-24 00:44:07 +00:00
  • bda209002e [BACKEND][CODEGEN] vectorization bugfix (#502) Philippe Tillet 2022-04-23 13:18:33 -07:00
  • 31dd4ab60e [GH-PAGES] Updated website Philippe Tillet 2022-04-23 00:44:46 +00:00
  • 0cc3b1129b [BACKEND][CODE_GEN] eviction policies now also apply to L2 (#501) Philippe Tillet 2022-04-21 23:56:01 -07:00
  • 7d6c504e8d [TESTING] Added testing utilities for fixing clock and using cuda-memcheck (#500) Philippe Tillet 2022-04-21 22:40:10 -07:00
  • cb69ba73a9 [GH-PAGES] Updated website Philippe Tillet 2022-04-22 00:44:52 +00:00
  • ab04e47bf2 [GH-PAGES] Updated website Philippe Tillet 2022-04-21 00:45:25 +00:00
  • 8d0e47d73e [GH-PAGES] Updated website Philippe Tillet 2022-04-20 00:43:07 +00:00
  • 245e5ab128 [GH-PAGES] Updated website Philippe Tillet 2022-04-19 00:42:04 +00:00
  • abcd11edba [GH-PAGES] Updated website Philippe Tillet 2022-04-18 00:44:04 +00:00
  • 51e029840e [GH-PAGES] Updated website Philippe Tillet 2022-04-17 00:44:02 +00:00
  • 9b0ee317d9 [GH-PAGES] Updated website Philippe Tillet 2022-04-16 00:44:44 +00:00
  • 9e304cf79d Allow JITFunction to return multiple results Yan Da 2022-04-15 15:38:19 +08:00
  • 1c52bd587d Device function & PassManager Yan Da 2022-04-15 14:41:57 +08:00
  • 073be1d2ee [FRONTEND] check that tensors have power-of-two number of elements (#499) Philippe Tillet 2022-04-14 19:30:02 -07:00
  • 824d060dfb [GH-PAGES] Updated website Philippe Tillet 2022-04-15 00:42:31 +00:00
  • 5c7122004c [TUTORIALS] Tutorial shouldn't expose clock. Just removed it. Philippe Tillet 2022-04-14 17:33:44 -07:00
  • dc4d40faec [FRONTEND] now mangle constexpr float containing "e-" Philippe Tillet 2022-04-14 10:26:48 -07:00
  • 2a53636c23 [GH-PAGES] Updated website Philippe Tillet 2022-04-14 00:44:57 +00:00
  • 25f6689508 [FRONTEND] rename current stream monkey patch (#495) Philippe Tillet 2022-04-13 11:45:55 -07:00
  • 9d65bf62fb [GH-PAGES] Updated website Philippe Tillet 2022-04-13 00:45:31 +00:00
  • 76bfac9f15 [FRONTEND] Improved constexpr handling (#493) Philippe Tillet 2022-04-12 00:02:54 -07:00
  • 44d75cf9bb Bugfix in ptxas path. (#487) apd10 2022-03-30 22:45:41 -05:00
  • 9be2d655a3 [DRIVER] LLVM driver fixup (#482) Philippe Tillet 2022-03-23 00:24:45 -07:00