Logo
Explore Help
Sign In
gaspersic/triton
1
0
Fork 0
You've already forked triton
Code Issues Pull Requests Packages Projects Releases Wiki Activity
Files
keren/insert-slice-other-nonzero
triton/lib/Dialect/TritonGPU/IR/CMakeLists.txt

12 lines
154 B
CMake
Raw Permalink Normal View History

more progress on TritonGPU
2022-04-28 18:51:31 +08:00
add_mlir_dialect_library(TritonGPUIR
Dialect.cpp
[Triton-MLIR] Replace triton.extract_slice with tensor.extract_slice and support more general tensor slicing (#837) ## Features - Allow taking a block of tensor slice, as long as each dimension is contiguous (unit stride). - Fix some problems in `insert_slice_async`'s semantic. - More general verification for ops that return shared layout encoding. ## Known Limitations - `insert_slice_async` still uses the old semantic. May submit another PR later to support similar semantic like `tensor.extract_slice`. - No encoding verification for `tensor.extract_slice`. - 3d tensor ops are broken. - Strided accesses are not allowed. - May cause a little performance slowdown since we are passing strides as values but not constants (e.g., int). It would be difficult to pass strides as attributes when we have control flows. A block argument is possible to accept tensors with different strides.
2022-11-06 22:59:03 -08:00
Traits.cpp
more progress on TritonGPU
2022-04-28 18:51:31 +08:00
DEPENDS
TritonGPUTableGen
TritonGPUAttrDefsIncGen
LINK_LIBS PUBLIC
TritonIR
)
Reference in New Issue Copy Permalink
Powered by Gitea Version: 1.24.3 Page: 168ms Template: 10ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API