Philippe Tillet
|
8ec9f037bb
|
[BACKEND/CODE_GEN] Fixed float32 matmul problem (#380)
|
2021-11-30 22:00:56 -08:00 |
|
Philippe Tillet
|
c86ad9c9ab
|
[FRONTEND] Added default arguments to non-kernel @triton.jit'd function (#379)
|
2021-11-29 19:11:26 -08:00 |
|
daadaada
|
1296eb877b
|
[RUNTIME] Config hook v2.0 (#373)
* Add pre_hook to triton.Config
* Use argument names in triton.heuristics
* Update base perf
* Remove meta from heuristics
|
2021-11-21 11:20:59 -08:00 |
|
Philippe Tillet
|
01cc3d4503
|
[RUNTIME] Restored do_not_specialize (#374)
|
2021-11-12 15:06:55 -08:00 |
|
Philippe Tillet
|
e66bf76354
|
[RUNTIME] Bunch of bugfixes (#372)
|
2021-11-12 00:55:00 -08:00 |
|
Philippe Tillet
|
2acaa4d0dd
|
[LANG] Added support for constexpr (#361)
|
2021-10-30 00:32:58 -07:00 |
|
daadaada
|
858dec8372
|
[CODEGEN] Add cache modifier to tl.load (#351)
* Add cache modifier to tl.load
* Add comment to cache_modifier
* Remove force_nc_cache
* Update test
|
2021-10-17 22:14:04 -07:00 |
|
Stephen McGroarty
|
c2e6b90ff1
|
[CODEGEN] Fixes masked load exception (#342)
|
2021-10-13 13:31:52 -07:00 |
|
Philippe Tillet
|
c3c0ff0552
|
[LANGUAGE] Fixed issue with duplicates in large arrays of random uniform numbers (#338)
|
2021-10-10 15:22:34 -07:00 |
|
daadaada
|
9e9d781912
|
[CODEGEN] Pipeline fixup (#336)
|
2021-10-10 01:47:11 -07:00 |
|
Philippe Tillet
|
5123db0b7d
|
[LANG] Various (relatively minor) improvements (#320)
|
2021-10-04 18:39:40 -07:00 |
|
Philippe Tillet
|
2c287544cb
|
[OPS] Faster and cleaner block-sparse implementation (#311)
|
2021-09-27 18:25:16 -07:00 |
|
Benjamin Lefaudeux
|
b53f5f3803
|
[OPS][BLOCKSPARSE] safeguarding a couple more configurations (#292)
|
2021-09-20 17:15:31 -07:00 |
|
Philippe Tillet
|
3e395bc84e
|
[LANG] Fixed semantics of NaN in float comparisons (#281)
|
2021-09-13 15:06:29 -07:00 |
|