[Triton-MLIR] Replace triton.extract_slice with tensor.extract_slice and support more general tensor slicing (#837)

## Features

- Allow taking a block of tensor slice, as long as each dimension is
contiguous (unit stride).
- Fix some problems in `insert_slice_async`'s semantic.
- More general verification for ops that return shared layout encoding.

## Known Limitations

- `insert_slice_async` still uses the old semantic. May submit another
PR later to support similar semantic like `tensor.extract_slice`.
- No encoding verification for `tensor.extract_slice`.
- 3d tensor ops are broken.
- Strided accesses are not allowed.
- May cause a little performance slowdown since we are passing strides
as values but not constants (e.g., int).
It would be difficult to pass strides as attributes when we have control
flows. A block argument is possible to accept tensors with different
strides.

This commit is contained in:

Keren Zhou

2022-11-06 22:59:03 -08:00

committed by

GitHub

parent a4ff0c362c

commit fdd59900f7

26 changed files with 507 additions and 339 deletions

									
										2

lib/Dialect/Triton/IR/Traits.cpp
									
												View File
												
				@@ -66,4 +66,4 @@ mlir::LogicalResult mlir::OpTrait::impl::verifyTensorSize(Operation *op) {

				    }

				  }

				  return success();

				}

				}

[Triton-MLIR] Replace triton.extract_slice with tensor.extract_slice and support more general tensor slicing (#837)

2 lib/Dialect/Triton/IR/Traits.cpp Unescape Escape View File

2

lib/Dialect/Triton/IR/Traits.cpp

View File