[flang][NFCI] Stop tracking memory source after a load in a more explicit manner. #126156

Renaud-K · 2025-02-06T23:48:06Z

Typically, we do not track memory sources after a load because of the dynamic nature of the load and the fact that the alias analysis is a simple static analysis.

However, the code is written in a way that makes it seem like we are continuing to track memory but in reality we are only doing so when we know that the tracked memory is a leaf and therefore when there will only be one more iteration through the switch statement. In other words, we are iterating one more time, to gather data about a box, anticipating that this will be the last time. This is a hack that helped avoid cut-and-paste from other case statements but gives the wrong impression about the intention of the code and makes it confusing.

To make it clear that there is no more tracking, we gather all the necessary data from the memref of the load, in the case statement for the load, and exit the loop. I am also limiting this data gathering for the case when we load a box reference while we were actually following data, as tests have shows, is the only case when we need it for. Other cases will be handled conservatively, but this can change in the future, on a case-by-case basis.

llvmbot · 2025-02-06T23:48:38Z

@llvm/pr-subscribers-flang-fir-hlfir

Author: Renaud Kauffmann (Renaud-K)

Changes

Typically, we do not track memory sources after a load because of the dynamic nature of the load and the fact that the alias analysis is a simple static analysis.

However the code is written in a way that makes it seem like we are continuing to track memory but in reality we are only doing so when we know that the tracked memory is a leaf and therefore that there will only be one more iteration through the switch statement. In other words, we are iterating one more time, to gather data about a box, anticipating that this will be the last time. This is a hack that helped avoid cut-and-paste from other case statements but gives the wrong impression about the intention of the code and makes it confusing.

To make it clear that there is no more tracking, we gather all the necessary data from the memref of the load, in the case statement for the load, and exit the loop. I am also limiting this data gathering for the case when we load a box reference while we were actually following data, as tests have shows, is the only case when we need it for. Other cases will be handled conservatively, but this can change in the future, on a case-by-case basis.

Full diff: https://github.com/llvm/llvm-project/pull/126156.diff

2 Files Affected:

(modified) flang/lib/Optimizer/Analysis/AliasAnalysis.cpp (+37-11)
(modified) flang/test/Analysis/AliasAnalysis/alias-analysis-2.fir (+4-2)

diff --git a/flang/lib/Optimizer/Analysis/AliasAnalysis.cpp b/flang/lib/Optimizer/Analysis/AliasAnalysis.cpp
index 01f3a0326db216e..6f65ca18c3074bf 100644
--- a/flang/lib/Optimizer/Analysis/AliasAnalysis.cpp
+++ b/flang/lib/Optimizer/Analysis/AliasAnalysis.cpp
@@ -51,7 +51,7 @@ static bool hasGlobalOpTargetAttr(mlir::Value v, fir::AddrOfOp op) {
       v, fir::GlobalOp::getTargetAttrName(globalOpName));
 }
 
-mlir::Value getOriginalDef(mlir::Value v) {
+static mlir::Value getOriginalDef(mlir::Value v) {
   mlir::Operation *defOp;
   bool breakFromLoop = false;
   while (!breakFromLoop && (defOp = v.getDefiningOp())) {
@@ -578,16 +578,7 @@ AliasAnalysis::Source AliasAnalysis::getSource(mlir::Value v,
             breakFromLoop = true;
         })
         .Case<fir::LoadOp>([&](auto op) {
-          // If the load is from a leaf source, return the leaf. Do not track
-          // through indirections otherwise.
-          // TODO: Add support to fir.alloca and fir.allocmem
-          auto def = getOriginalDef(op.getMemref());
-          if (isDummyArgument(def) ||
-              def.template getDefiningOp<fir::AddrOfOp>()) {
-            v = def;
-            defOp = v.getDefiningOp();
-            return;
-          }
+          
           // If load is inside target and it points to mapped item,
           // continue tracking.
           Operation *loadMemrefOp = op.getMemref().getDefiningOp();
@@ -600,6 +591,41 @@ AliasAnalysis::Source AliasAnalysis::getSource(mlir::Value v,
             defOp = v.getDefiningOp();
             return;
           }
+
+          // If we are loading a box reference, but following the data,
+          // we gather the attributes of the box to populate the source
+          // and stop tracking.
+          if (auto boxTy = mlir::dyn_cast<fir::BaseBoxType>(ty);
+              boxTy && followingData) {
+
+            if (mlir::isa<fir::PointerType>(boxTy.getEleTy())) {
+              attributes.set(Attribute::Pointer);
+            }
+
+            auto def = getOriginalDef(op.getMemref());
+            if (auto addrOfOp = def.template getDefiningOp<fir::AddrOfOp>()) {
+              global = addrOfOp.getSymbol();
+
+              if (hasGlobalOpTargetAttr(def, addrOfOp))
+                attributes.set(Attribute::Target);
+
+              type = SourceKind::Global;
+            }
+
+            // TODO: Add support to fir.alloca and fir.allocmem
+            // if (auto allocOp = def.template getDefiningOp<fir::AllocaOp>()) {
+            //   ...
+            // }
+
+            if (isDummyArgument(def)) {
+              defOp = nullptr;
+              v = def;
+            }
+            
+            breakFromLoop = true;
+            return;
+          }
+
           // No further tracking for addresses loaded from memory for now.
           type = SourceKind::Indirect;
           breakFromLoop = true;
diff --git a/flang/test/Analysis/AliasAnalysis/alias-analysis-2.fir b/flang/test/Analysis/AliasAnalysis/alias-analysis-2.fir
index ca97c5900281d64..3fbf29ab2eb2926 100644
--- a/flang/test/Analysis/AliasAnalysis/alias-analysis-2.fir
+++ b/flang/test/Analysis/AliasAnalysis/alias-analysis-2.fir
@@ -49,11 +49,13 @@
 
 // TODO: Can the address in a pointer alias the address of a pointer, even when the
 // pointer has no box. Should this be NoAlias?
-// T3: CHECK-DAG: p1.addr#0 <-> p1.tgt#0: MayAlias
+// T3: 
+// CHECK-DAG: p1.addr#0 <-> p1.tgt#0: MayAlias
 
 // The addresses stored in two different pointers can alias, even if one has no
 // box.  In this program, they happen to be the same address.
-// T4: CHECK-DAG: p1.tgt#0 <-> boxp1.addr#0: MayAlias
+// T4: 
+// CHECK-DAG: p1.tgt#0 <-> boxp1.addr#0: MayAlias
 
 func.func @_QFPtest(%arg0: !fir.ref<f32> {fir.bindc_name = "v1", fir.target}, %arg1: !fir.ref<f32> {fir.bindc_name = "v2", fir.target}, %arg2: !fir.ref<!fir.box<!fir.ptr<f32>>> ) attributes {test.ptr = "func"} {

github-actions · 2025-02-06T23:55:26Z

✅ With the latest revision this PR passed the C/C++ code formatter.

jeanPerier

Thanks Renaud, this makes sense to me. I wonder however if you could not use getSource recursively and decide what attributes must be propagated to the address being analyzed from the address that contains it.

That way, you do not have to implement the walk/attribute gathering again, but only need to decide the "attribute propagation" logic.

jeanPerier · 2025-02-07T09:11:58Z

flang/lib/Optimizer/Analysis/AliasAnalysis.cpp

+
+            if (mlir::isa<fir::PointerType>(boxTy.getEleTy())) {
+              attributes.set(Attribute::Pointer);
+            }


nit: no braces here.

jeanPerier · 2025-02-07T09:24:16Z

flang/lib/Optimizer/Analysis/AliasAnalysis.cpp

+                attributes.set(Attribute::Target);
+
+              type = SourceKind::Global;
+            }


Aren't here other cases where the Target attribute should be set (local/dummy allocatables with TARGET attribute for instance)?

Why not calling getSource on op.getMemref() and propagating the TARGET/POINTER attribute/whatever is relevant to propagate from the variable containing the address to the address?

The plan was to be doing this in getOriginalDef as in #117785 which has the tests that expose the need for it.
I could do it here, if you have concerns that we might be losing something.

Though, I have added a test that shows that we are reading the target attribute properly. It's only when we add support for local boxes that the attribute needs to be extracted from fir.declare. This is coming.

jeanPerier · 2025-02-07T09:25:24Z

flang/lib/Optimizer/Analysis/AliasAnalysis.cpp

@@ -600,6 +590,41 @@ AliasAnalysis::Source AliasAnalysis::getSource(mlir::Value v,
            defOp = v.getDefiningOp();
            return;


Any idea why the OpenMP case should not follow the same logic and continue the walk?

This is why I added @DominikAdamski to the review. To show that we want to set a better precedent for loads and see if maybe something similar can be done for Omp. I did not quite follow the need for it.

Renaud-K · 2025-02-07T16:36:11Z

Using getSource recursively, is a bit what got us into trouble because it is stretching the original goal of the function which is designed to get the source. To gather information about the memref we have getOriginalDef which is much more limited in scope. In #117785 getOriginalDef will also be gathering more information.

…e on globals

flang/test/Analysis/AliasAnalysis/alias-analysis-2.fir

jdenny-ornl · 2025-02-07T18:22:46Z

flang/lib/Optimizer/Analysis/AliasAnalysis.cpp

+          if (auto boxTy = mlir::dyn_cast<fir::BaseBoxType>(ty);
+              boxTy && followingData) {


Is a goal of this new condition to ensure cases like fir.alloca !fir.ptr<T> and !fir.ref<!fir.ptr<T>> will never be handled by this code? Thus, T3 and T4 will stay MayAlias, even after we extend this for alloca? If so, that works for my current use cases.

For now (before we extend for alloca), is this PR intended to be NFC?

It does do that but it is not really the intention. The functionality should remain vastly the same, so maybe NFCI. Before and after, the load of a fir.alloca will be a SourceKind::Indirect. If we are to extend data vs non-data, we will, it is just code. What is emerging from this change and the fact that everything still works with it, is a new idea for me: it could make sense to have a SourceKind::Emboxed memory source in the future. But this is a topic for a different discussion.

Point to where T3 is discussed. Co-authored-by: Joel E. Denny <[email protected]>

Slight comment change. Co-authored-by: Joel E. Denny <[email protected]>

Renaud-K requested review from tblah and DominikAdamski February 6, 2025 23:48

llvmbot added flang Flang issues not falling into any other category flang:fir-hlfir labels Feb 6, 2025

Renaud-K requested review from jeanPerier, jdenny-ornl and vzakhari February 6, 2025 23:48

Applying formatting

7299f21

Renaud-K force-pushed the aa-refactor branch from 6db23e2 to 7299f21 Compare February 7, 2025 00:11

jeanPerier reviewed Feb 7, 2025

View reviewed changes

Renaud-K added 2 commits February 7, 2025 08:55

Removing curly brackets

8d43bd4

Adding a test that makes sure that we are reading the target attribut…

3139abc

…e on globals

jdenny-ornl reviewed Feb 7, 2025

View reviewed changes

Renaud-K changed the title ~~[flang] Stop tracking memory source after a load in a more explicit manner.~~ [flang][NFCI] Stop tracking memory source after a load in a more explicit manner. Feb 7, 2025

Renaud-K and others added 2 commits February 7, 2025 11:21

Update flang/test/Analysis/AliasAnalysis/alias-analysis-2.fir

ff140e7

Point to where T3 is discussed. Co-authored-by: Joel E. Denny <[email protected]>

Update flang/test/Analysis/AliasAnalysis/alias-analysis-2.fir

7f7b43c

Slight comment change. Co-authored-by: Joel E. Denny <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[flang][NFCI] Stop tracking memory source after a load in a more explicit manner. #126156

[flang][NFCI] Stop tracking memory source after a load in a more explicit manner. #126156

Renaud-K commented Feb 6, 2025 •

edited

Loading

llvmbot commented Feb 6, 2025

github-actions bot commented Feb 6, 2025 •

edited

Loading

jeanPerier left a comment

jeanPerier Feb 7, 2025

jeanPerier Feb 7, 2025

Renaud-K Feb 7, 2025 •

edited

Loading

Renaud-K Feb 7, 2025

jeanPerier Feb 7, 2025

Renaud-K Feb 7, 2025

Renaud-K commented Feb 7, 2025

jdenny-ornl Feb 7, 2025

Renaud-K Feb 7, 2025 •

edited

Loading

		@@ -600,6 +590,41 @@ AliasAnalysis::Source AliasAnalysis::getSource(mlir::Value v,
		defOp = v.getDefiningOp();
		return;

		if (auto boxTy = mlir::dyn_cast<fir::BaseBoxType>(ty);
		boxTy && followingData) {

[flang][NFCI] Stop tracking memory source after a load in a more explicit manner. #126156

Are you sure you want to change the base?

[flang][NFCI] Stop tracking memory source after a load in a more explicit manner. #126156

Conversation

Renaud-K commented Feb 6, 2025 • edited Loading

llvmbot commented Feb 6, 2025

github-actions bot commented Feb 6, 2025 • edited Loading

jeanPerier left a comment

Choose a reason for hiding this comment

jeanPerier Feb 7, 2025

Choose a reason for hiding this comment

jeanPerier Feb 7, 2025

Choose a reason for hiding this comment

Renaud-K Feb 7, 2025 • edited Loading

Choose a reason for hiding this comment

Renaud-K Feb 7, 2025

Choose a reason for hiding this comment

jeanPerier Feb 7, 2025

Choose a reason for hiding this comment

Renaud-K Feb 7, 2025

Choose a reason for hiding this comment

Renaud-K commented Feb 7, 2025

jdenny-ornl Feb 7, 2025

Choose a reason for hiding this comment

Renaud-K Feb 7, 2025 • edited Loading

Choose a reason for hiding this comment

Renaud-K commented Feb 6, 2025 •

edited

Loading

github-actions bot commented Feb 6, 2025 •

edited

Loading

Renaud-K Feb 7, 2025 •

edited

Loading

Renaud-K Feb 7, 2025 •

edited

Loading