Yan Da
1c52bd587d
Device function & PassManager
2022-04-15 14:41:57 +08:00
Keren Zhou
f51e0b1be4
[FRONTEND] Hot fix for lineno ( #481 )
...
Override __reduce__ to make CompilationError pickable and print out error messages
2022-04-12 13:02:33 +08:00
Yan Da
7e0fd97965
Add set_attr(...) to ir.OpState
2022-04-11 12:26:54 +08:00
Yan Da
4eb062f313
fix issues in visit_If
2022-04-10 16:28:45 +08:00
Yan Da
fcbbb3c10e
Fix visit_While issues
2022-04-10 16:16:13 +08:00
Yan Da
19f81b7dea
Add scf-codegen tests
2022-04-10 15:49:09 +08:00
Yan Da
9c7b3d5173
Manage insertion block with context manager
2022-04-10 15:02:12 +08:00
Yan Da
aa6e086881
Add more comments
2022-04-10 14:36:03 +08:00
Yan Da
6002340456
Better textual representation
2022-04-07 20:44:41 +08:00
Yan Da
62f772123c
now kernel functions return nothing (instead of none)
2022-04-07 20:22:17 +08:00
Yan Da
040a2b6c75
Fix OpBuilder
2022-04-07 20:01:31 +08:00
Yan Da
6b4da6f016
Documentation
2022-04-07 16:00:53 +08:00
Yan Da
9cf4107990
Add TensorSizeTrait
2022-04-07 15:18:43 +08:00
Yan Da
39fad2b18a
More progress on WhileOp
2022-04-05 17:55:43 +08:00
Yan Da
d7fbddc7d4
Fix ret::reference issue
2022-04-05 16:09:09 +08:00
Yan Da
c7ad928e60
More progress on WhileOp codegen
2022-04-05 15:55:48 +08:00
Yan Da
0f96da336a
codegen for If
2022-04-04 12:58:37 +08:00
Yan Da
9df899b291
Some progress on visit_If
2022-04-03 22:34:46 +08:00
Yan Da
c71c50cd0c
ForOp's SSA construction
2022-04-03 19:11:47 +08:00
Yan Da
61413b8a97
More python bindings
2022-04-01 22:22:39 +08:00
Yan Da
9dafa0e2e3
Update trtion dependencies
2022-04-01 20:16:07 +08:00
Yan Da
bde103fab0
Replace MlirType with mlir::Type
2022-04-01 18:46:46 +08:00
Yan Da
4ad432f1fc
More on scf Ops
2022-03-31 21:42:48 +08:00
Yan Da
2041b67fbf
Now vecadd works
2022-03-30 20:21:47 +08:00
Yan Da
e381dc72c5
Use mlir::Block to replace MlirBlock
2022-03-30 16:31:03 +08:00
Yan Da
e95d98a886
bindings for ModuleOp
2022-03-30 13:32:52 +08:00
Yan Da
38e67b4293
Add more Ops
2022-03-28 19:50:23 +08:00
Yan Da
0d139ec460
Introducing SCF
2022-03-26 17:02:32 +08:00
Yan Da
c53f3486e4
create shr
2022-03-26 16:41:49 +08:00
Yan Da
ba16116f96
Let python manage created objects
2022-03-26 16:31:01 +08:00
Yan Da
fed9925bbd
Using stable LLVM release
2022-03-26 16:25:18 +08:00
Yan Da
a17fba86b1
Logic Op creation
2022-03-26 16:16:20 +08:00
Yan Da
d5612333c0
More fcmp ops
2022-03-25 14:12:20 +08:00
Yan Da
07881b4d41
Update includes
2022-03-24 13:46:35 +08:00
Yan Da
cf7fc8d642
Update includes
2022-03-24 13:33:54 +08:00
Yan Da
14a71dcb6f
Replace MlirOperation with MlirValue
2022-03-23 13:31:14 +08:00
Yan Da
f2ab318614
New python binding
2022-03-22 21:53:22 +08:00
Yan Da
419bbe0f6e
Reverts back to MLIR 14 & updates CMakeLists
2022-03-20 16:41:48 +08:00
Yan Da
a2c31ff434
Init commit
2022-03-17 20:40:55 +08:00
daadaada
539961072c
[FRONTEND] Semantic analysis refactor ( #473 )
...
Moved dispatch.cc to semantic.py
Integer signedness now moved from C++ to python
Cleaner frontend type
Co-authored-by: Phil Tillet <phil@openai.com >
2022-03-16 21:25:30 -07:00
Yongjik Kim
0dd2ec2e3a
[FRONTEND] Add an assert in case we get a CPU tensor. ( #478 )
2022-03-16 14:38:56 -07:00
Philippe Tillet
d4d8eaf6c0
[FRONTEND] improved caching mechanism ( #474 )
...
Co-authored-by: Greg Brockman <gdb@gregbrockman.com >
Co-authored-by: Christopher Hesse <christopherhesse@users.noreply.github.com >
2022-03-15 12:20:51 -07:00
Philippe Tillet
98ed7db8c1
[CODEGEN] Improvements and bugfixes ( #463 )
2022-02-24 14:56:24 -08:00
daadaada
a9dfdcaaa9
[FRONTEND] Make the performance model work for int8, tf32, and fp32 ( #456 )
2022-02-11 22:34:42 -08:00
Philippe Tillet
9b100302d3
[FRONTEND] Now using pybind11 to release GIL ( #458 )
2022-02-10 01:57:39 -08:00
Philippe Tillet
7b48340ffd
[CI] Some fixes for the build ( #451 )
2022-02-06 19:11:33 -08:00
Philippe Tillet
5a8a544d10
[OPS][BLOCKSPARSE] Improved robustness, clarity and performance ( #450 )
...
* dds layout now internally re-uses dsd code path for increased code
* at_mask and kp_mask related things are now dropped from the softmax API. I couldn't think of any case where it was needed beyond is_causal. And if there is any, we should probably find a way to get it implemented statically so that users don't have to materialize masks.
* fixed bug in blocksparse matmul that caused troubles when layout had a full row/col of zeros
* blocksparse softmax now no longer modifies any data in-place
* blocksparse softmax now takes an is_dense arguments that provides better performance. Passing is_dense=True, is_causal=True is the best way to achieve triangular attention.
* unit tests now test backward pass
2022-02-06 18:00:45 -08:00
TC
137bb67fad
[LANG] Add fp16 to fp8 conversion ( #444 )
2022-02-02 20:42:09 -08:00
Philippe Tillet
b0d6e2f322
[STYLE] run autopep
2022-01-30 20:27:44 -08:00
Philippe Tillet
2922dc141c
Merge branch 'master' into v2.0
2022-01-30 20:25:01 -08:00