[CIR][AArch64][Lowering] Support fields with structs containing constant arrays or pointers #1136

bruteforceboy · 2024-11-18T14:25:57Z

This PR adds support for function arguments with structs that contain constant arrays or pointers for AArch64.

For example,

typedef struct {
  int a[42];
} CAT;

void pass_cat(CAT a) {}

As usual, the main ideas are gotten from the original CodeGen, and I have added a couple of tests.

The loop was erasing the user of a value while iterating on the value's users, which results in a use after free. We're already assuming (and asserting) that there's only one user, so we can just access it directly instead. CIR/Transforms/Target/x86_64/x86_64-call-conv-lowering-pass.cpp was failing with ASAN before this change. We're now ASAN-clean except for llvm#829 (which is also in progress).

Reland llvm#638 This was reverted due to llvm#655. I tried to address the problem in the newest commit. The changes of the PR since the last landed one includes: - Move the definition of `cir::CIRGenConsumer` to `clang/include/clang/CIRFrontendAction/CIRGenConsumer.h`, and leave its `HandleTranslationUnit` interface is left empty. So that `cir::CIRGenConsumer` won't need to depend on CodeGen any more. - Change the old definition of `cir::CIRGenConsumer` in `clang/lib/CIR/FrontendAction/CIRGenAction.cpp` and to `CIRLoweringConsumer`, inherited from `cir::CIRGenConsumer`, which implements the original `HandleTranslationUnit` interface. I feel this may improve the readability more even without my original patch.

This PR fixes the lowering for multi dimensional arrays. Consider the following code snippet `test.c`: ``` void foo() { char arr[4][1] = {"a", "b", "c", "d"}; } ``` When ran with `bin/clang test.c -Xclang -fclangir -Xclang -emit-llvm -S -o -`, It produces the following error: ``` ~/clangir/llvm/include/llvm/Support/Casting.h:566: decltype(auto) llvm::cast(const From&) [with To = mlir::ArrayAttr; From = mlir::Attribute]: Assertion `isa<To>(Val) && "cast<Ty>() argument of incompatible type!"' failed. ``` The bug can be traced back to `LoweringHelpers.cpp`. It considers the values in the array as integer types, and this causes an error in this case. This PR updates `convertToDenseElementsAttrImpl` when the array contains string attributes. I have also added one more similar test. Note that in the tests I used a **literal match** to avoid matching as regex, so `!dbg` is useful.

Support expressions at the top level such as const unsigned int n = 1234; const int &r = (const int&)n; Reviewers: bcardosolopes Pull Request: llvm#857

This is to match clang CodeGen

@smeenai

Fix llvm#829 Thanks @smeenai for pointing out the root cause and UBSan failure!

As title. Also introduced buildAArch64NeonCall skeleton, which is partially the counterpart of OG's EmitNeonCall. And this could be use for many other neon intrinsics. --------- Co-authored-by: Guojin He <[email protected]>

… it (llvm#859)

These were uninitialized, which led to intermittent test failures from the use of uninitialized variables. Initialize them to `nullptr` as is done with other member variables that are pointers to fix this. I did a quick spot-check and didn't find other uninitialized variables in the main CGF class itself. Lots of subclasses have uninitialized member variables, but those are presumably expected to be initialized at all points of construction, so we can leave them alone until they cause any issues. `ninja check-clang-cir` now passes with ASan+UBSan and MSan. Fixes llvm#829

See the test for example.

This PR adds aarch64 big endian support. Basically the support for aarch64_be itself is expressed only in two extra cases for the switch statement and changes in the `CIRDataLayout` are needed to prove that we really support big endian. Hence the idea for the test - I think the best way for proof is something connected with bit-fields, so we compare the results of the original codegen and ours.

This PR splits the old `cir-simplify` pass into two new passes, namely `cir-canonicalize` and `cir-simplify` (the new `cir-simplify`). The `cir-canonicalize` pass runs transformations that do not affect CIR-to-source fidelity much, such as operation folding and redundant operation elimination. On the other hand, the new `cir-simplify` pass runs transformations that may significantly change the code and break high-level code analysis passes, such as more aggresive code optimizations. This PR also updates the CIR-to-CIR pipeline to fit these two new passes. The `cir-canonicalize` pass is moved to the very front of the pipeline, while the new `cir-simplify` pass is moved to the back of the pipeline (but still before lowering prepare of course). Additionally, the new `cir-simplify` now only runs when the user specifies a non-zero optimization level on the frontend. Also fixed some typos and resolved some `clang-tidy` complaints along the way. Resolves llvm#827 .

Currently the C style cast is not implemented/supported for unions. This PR adds support for union casts as done in `CGExprAgg.cpp`. I have also added an extra test in `union-init.c`.

Mistakenly closed llvm#850 llvm#850 (review) This PR fixes array initialization for expression arguments. Consider the following code snippet `test.c`: ``` typedef struct { int a; int b[2]; } A; int bar() { return 42; } void foo() { A a = {bar(), {}}; } ``` When ran with `bin/clang test.c -Xclang -fclangir -Xclang -emit-cir -S -o -`, It produces the following error: ``` ~/clangir/clang/lib/CIR/CodeGen/CIRGenExprAgg.cpp:483: void {anonymous}::AggExprEmitter::buildArrayInit(cir::Address, mlir::cir::ArrayType, clang::QualType, clang::Expr*, llvm::ArrayRef<clang::Expr*>, clang::Expr*): Assertion `NumInitElements != 0' failed. ``` The error can be traced back to `CIRGenExprAgg.cpp`, and the fix is simple. It is possible to have an empty array initialization as an expression argument!

As title, if element type of vector type is sized, then the vector type should be deemed sized. This would enable us generate code for neon without triggering assertion

…eon_vrndaq_v (llvm#871) as title. This also added NeonType support for Float32 Co-authored-by: Guojin He <[email protected]>

…::saved_type::save

It will hit another assert when calling initFullExprCleanup.

This PR fixes the case, when a temporary var is used, and `alloca` operation is inserted in the block start before the `label` operation. Implementation: when we search for the `alloca` place in a block, we take label operations into account as well. Fix llvm#870 --------- Co-authored-by: Bruno Cardoso Lopes <[email protected]>

__attribute__((annotate()) was only accepting integer literals, preventing some meta-programming usage for example. This should be extended to some other kinds of types. --------- Co-authored-by: Bruno Cardoso Lopes <[email protected]>

Just as the title says, but only covers non-exception path, that's coming next.

Nothing unblocked yet, just hit next assert in the same path.

… exceptions Code path still hits an assert sooner, incremental NFC step.

…lvm#878) Close llvm#876 We've already considered the case that there are random stmt after a switch case: ``` for (auto *c : compoundStmt->body()) { if (auto *switchCase = dyn_cast<SwitchCase>(c)) { res = buildSwitchCase(*switchCase, condType, caseAttrs); } else if (lastCaseBlock) { // This means it's a random stmt following up a case, just // emit it as part of previous known case. mlir::OpBuilder::InsertionGuard guardCase(builder); builder.setInsertionPointToEnd(lastCaseBlock); res = buildStmt(c, /*useCurrentScope=*/!isa<CompoundStmt>(c)); } else { llvm_unreachable("statement doesn't belong to any case region, NYI"); } lastCaseBlock = builder.getBlock(); if (res.failed()) break; } ``` However, maybe this is an oversight, in the branch of ` if (lastCaseBlock)`, the insertion point will be updated automatically when the RAII object `guardCase` destroys, then we can assign the correct value for `lastCaseBlock` later. So we will see the weird code pattern in the issue side. BTW, I found the codes in CIRGenStmt.cpp are far more less similar with the ones other code gen places. Is this intentional? And what is the motivation and guide lines here?

as title.

…lvm#882) As title. Notice that for those intrinsics, just like OG, we do not lower to llvm intrinsics, instead, do vector insert. The test case is partially from OG [aarch64-neon-vget.c](https://github.com/llvm/clangir/blob/85bc6407f559221afebe08a60ed2b50bf1edf7fa/clang/test/CodeGen/aarch64-neon-vget.c) But, I did not do all signed and unsigned int tests because unsigned and signed of the same width essentially just use the same intrinsic ID thus exactly same code path as far as this PR concerns. --------- Co-authored-by: Guojin He <[email protected]>

…, neon_splatq_lane and neon_splatq_laneq (llvm#1126)

This is going to be raised in follow up work, which is hard to do in one go because createBaseClassAddr goes of the OG skeleton and ideally we want ApplyNonVirtualAndVirtualOffset to work naturally. This also doesn't handle null checks, coming next.

… paths

Now that we fixed the dep on VBase, clean up the rest of the function.

…e BaseClassAddrOp

It was always the intention for `cir.cmp` operations to return bool result. Due to missing constraints, a bug in codegen has slipped in which created `cir.cmp` operations with result type that matches the original AST expression type. In C, as opposed to C++, boolean expression types are "int". This resulted with extra operations being codegened around boolean expressions and their usage. This commit both enforces `cir.cmp` in the op definition and fixes the mentioned bug.

@bcardosolopes

…vm#1135) support `llvm.intr.memset.inline` in llvm-project repo before we add support for `__builtin_memset_inline` in clangir cc @bcardosolopes (cherry picked from commit 30753af)

This is the first patch to support TBAA, following the discussion at llvm#1076 (comment) - add skeleton for CIRGen, utilizing `decorateOperationWithTBAA` - add empty implementation in `CIRGenTBAA` - introduce `CIR_TBAAAttr` with empty body - attach `CIR_TBAAAttr` to `LoadOp` and `StoreOp` - no handling of vtable pointer - no LLVM lowering

) The title describes the purpose of the PR. It adds initial support for structures with padding to the call convention lowering for AArch64. I have also _initial support_ for the missing feature [FinishLayout](https://github.com/llvm/clangir/blob/5c5d58402bebdb1e851fb055f746662d4e7eb586/clang/lib/AST/RecordLayoutBuilder.cpp#L786) for records, and the logic is gotten from the original codegen. Finally, I added a test for verification.

…#1143)

smeenai

Should we also add a test for a struct containing a smaller array (which should be passed via registers instead of pointer)?

bruteforceboy · 2024-11-21T13:47:40Z

Should we also add a test for a struct containing a smaller array (which should be passed via registers instead of pointer)?

IMHO, the test is okay as it is) The main idea of the PR is adding support for the types because of the "NYI" fail previously. The sizes or how it is passed doesn't really matter (for this PR at least).

smeenai

Checking for different array sizes would get some coverage for the size calculation e.g. making sure that int[4] is passed in registers but long[4] isn't. You're right that it's not strictly necessary here though.

smeenai and others added 30 commits November 2, 2024 23:32

[CIR][CodeGen] Support global temporaries

edfffa5

Support expressions at the top level such as const unsigned int n = 1234; const int &r = (const int&)n; Reviewers: bcardosolopes Pull Request: llvm#857

[CIR][CodeGen][NFC] Move GetUndefRValue to the right file

17d8d70

This is to match clang CodeGen

[CIR][CIRGen] Exceptions: lexical scope issue with global initializers

51bbb15

Fix llvm#829 Thanks @smeenai for pointing out the root cause and UBSan failure!

[CIR][CodeGen][NFC] Add TBAAAccessInfo stubbed out and many usages of…

c4e85ea

… it (llvm#859)

[CIR][CodeGen] Stub out an empty CIRGenDebugInfo type

0fc106c

[CIR][CIRGen] Implement Nullpointer arithmatic extension (llvm#861)

49ca6dd

See the test for example.

[CIR][CodeGen] Implement union cast (llvm#867)

83145a4

Currently the C style cast is not implemented/supported for unions. This PR adds support for union casts as done in `CGExprAgg.cpp`. I have also added an extra test in `union-init.c`.

[CIR][CIRGen] Exceptions: unlock nested try/catch support

4e66034

[CIR][CIRGen] Correct isSized predicate for vector type (llvm#869)

73490a3

As title, if element type of vector type is sized, then the vector type should be deemed sized. This would enable us generate code for neon without triggering assertion

[CIR][CIRGen][Builtin][Neon] Lower builtin_neon_vrnda_v and builtin_n…

47d3674

…eon_vrndaq_v (llvm#871) as title. This also added NeonType support for Float32 Co-authored-by: Guojin He <[email protected]>

[CIR][CIRGen] Handle VisitCXXRewrittenBinaryOperator for scalars

7fbe3f4

[CIR][CIRGen][NFC] Cleanups: add skeleton for DominatingValue<RValue>…

ecdaa8d

…::saved_type::save

[CIR][Infra] Run check-clang-cir against any branch based PR (llvm#873)

671564f

[CIR][CIRGen][NFC] Cleanups: add more skeleton to pushFullExprCleanup

f713df2

It will hit another assert when calling initFullExprCleanup.

[CIR][CIRGen] Cleanups: handle conditional cleanups

f7c98af

Just as the title says, but only covers non-exception path, that's coming next.

[CIR][CIRGen][NFC] Cleanups: Prepare for conditional cleanup

d02a8d8

Nothing unblocked yet, just hit next assert in the same path.

[CIR][CIRGen][NFC] Cleanups: more boilerplate work for conditional on…

41bcfcd

… exceptions Code path still hits an assert sooner, incremental NFC step.

[CIR][CIRGen] Generate CIR for empty compound literal (llvm#880)

34f1e38

as title.

ghehg and others added 11 commits November 14, 2024 21:33

[CIR][CIRGen][Builtin][Neon] Lower neon_splat_lane, neon_splat_laneq…

8fbc640

…, neon_splatq_lane and neon_splatq_laneq (llvm#1126)

[CIR][CIRGen][Builtin] Support __builtin___memmove_chk (llvm#1106)

ab9fbcf

[CIR][NFC] Fix unused variable warning

29cb9bc

[CIR][CIRGen] Bring getAddressOfBaseClass a bit closer to OG

a61e202

[CIR][CIRGen][NFC] More unification of virtual and non-virtual offset…

2430c26

… paths

[CIR][CIRGen][NFC] More skeleton conformance

c10f493

Now that we fixed the dep on VBase, clean up the rest of the function.

[CIR][CIRGen] Teach all uses of ApplyNonVirtualAndVirtualOffset to us…

3aed38c

…e BaseClassAddrOp

initial commit

45a6e17

update tests

157d5fb

update tests

5810d6c

bruteforceboy changed the title ~~[CIR][AArch64][Lowering] Support Function Arguments With Structs Containing Constant Arrays or Pointers~~ [CIR][AArch64][Lowering] Support function arguments with structs containing constant arrays or pointers Nov 18, 2024

bruteforceboy changed the title ~~[CIR][AArch64][Lowering] Support function arguments with structs containing constant arrays or pointers~~ [CIR][AArch64][Lowering] Support fields with structs containing constant arrays or pointers Nov 18, 2024

bruteforceboy marked this pull request as ready for review November 18, 2024 14:35

bruteforceboy requested review from lanza and bcardosolopes as code owners November 18, 2024 14:35

orbiri and others added 2 commits November 18, 2024 08:57

[cherry-pick][mlir][llvm] Add support for memset.inline (#115711) (ll…

da601b3

…vm#1135) support `llvm.intr.memset.inline` in llvm-project repo before we add support for `__builtin_memset_inline` in clangir cc @bcardosolopes (cherry picked from commit 30753af)

smeenai requested review from gitoleg and sitio-couto November 19, 2024 05:27

PikachuHyA and others added 5 commits November 19, 2024 09:40

[CIR][CIRGen] Support __builtin_memset_inline (llvm#1114)

affa8f8

[CIR] fix deref nullptr when verify symbol for cir.get_global (llvm…

bae7bd9

…#1143)

Merge remote-tracking branch 'origin/main' into lower-context

43627ae

smeenai reviewed Nov 20, 2024

View reviewed changes

smeenai approved these changes Nov 21, 2024

View reviewed changes

smeenai force-pushed the main branch 2 times, most recently from 4aca8d4 to a04cf10 Compare November 23, 2024 06:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CIR][AArch64][Lowering] Support fields with structs containing constant arrays or pointers #1136

[CIR][AArch64][Lowering] Support fields with structs containing constant arrays or pointers #1136

bruteforceboy commented Nov 18, 2024

smeenai left a comment

bruteforceboy commented Nov 21, 2024

smeenai left a comment

[CIR][AArch64][Lowering] Support fields with structs containing constant arrays or pointers #1136

Are you sure you want to change the base?

[CIR][AArch64][Lowering] Support fields with structs containing constant arrays or pointers #1136

Conversation

bruteforceboy commented Nov 18, 2024

smeenai left a comment

Choose a reason for hiding this comment

bruteforceboy commented Nov 21, 2024

smeenai left a comment

Choose a reason for hiding this comment