Remove unnecessary repetition of `update_aliases` #2772

shino16 · 2025-11-26T14:17:43Z

Fixes #2768. The primary role of prims.update_aliases is to establish relative ordering between bsyms involving aliases and mutation. But when relative ordering is already established by functional dependency, inserting prims.update_aliases does no good and only causes unnecessary fusion break.

Since variable substitution via swap_map creates extra functional dependencies, this should be done before deciding whether we need to insert prims.update_aliases. This eliminates the unnecessary prims.update_aliases.

But there is an exception. In current behavior we insert prims.update_aliases before every in-place op, and we must keep this regardless. nvFuser restricts mutation on tensors that are not inputs to the fused region, and we have no mechanism to comply with this rule other than to break fusion for every mutation. See #2768 (comment) for details.

Copilot

Pull request overview

This pull request aims to remove unnecessary repetition of update_aliases calls in the alias update logic. The changes include:

Refactoring the condition logic in insert_alias_updates to handle inplace operations differently
Adding special handling for inplace ops to force insertion of update_aliases (as a workaround for nvFuser limitations)
Adding a new test to verify the expected number of update_aliases calls

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
thunder/core/update_aliases.py	Refactors the alias update insertion logic, moving the `from_bsym_swap_proxies` call earlier and adding special handling for inplace operations
thunder/tests/test_update_aliases.py	Adds test `test_update_aliases_count` to verify the expected number of `update_aliases` calls for functions with different numbers of inplace operations

Comments suppressed due to low confidence (1)

thunder/core/update_aliases.py:192

Potential double-swapping issue: bsym is swapped on line 168 with skip_output=True, then later on line 192, from_bsym_swap_proxies(swap_map) is called again on the same (already-swapped) bsym with an updated swap_map. This could lead to incorrect proxy substitution. Consider storing the original bsym before swapping on line 168, or restructure the logic to avoid calling from_bsym_swap_proxies twice on the same symbol.

    for bsym in computation_trace.bound_symbols:
        if _is_inplace_op(bsym) or _is_view_creation_op(bsym) or _involves_viewed_args(bsym, viewed):
            bsym = bsym.from_bsym_swap_proxies(swap_map, skip_output=True)
            in_tensors = list(map(variableify, filter(lambda p: isinstance(p, TensorProxy), bsym.flat_proxy_args)))
            if _is_inplace_op(bsym) and in_tensors:
                in_tensors = {in_tensors[0]}
            else:
                in_tensors = set(in_tensors)
            out_tensors = set(map(variableify, filter(lambda p: isinstance(p, TensorProxy), bsym.flat_proxy_outs)))
            encountered.update(in_tensors)
            group = set(reduce(set.union, filter(lambda g: any(g.intersection(in_tensors)), view_groups), set()))
            views_encountered = group.intersection(encountered)

            if _is_inplace_op(bsym):
                # Super-hacky workaround to insert fusion break because nvFuser doesn't support mutation on intermediates
                # See https://github.com/Lightning-AI/lightning-thunder/issues/2768#issuecomment-3581908434
                views_encountered = in_tensors

            if not views_encountered:
                # This is a view creation with operands that are not involved in any inplace ops.
                bsyms.append(bsym)
                continue

            new_aliases = _get_new_aliases(views_encountered, computation_trace)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

thunder/core/update_aliases.py

crcrpar

nvFuser restricts mutation on tensors that are not inputs to the fused region, and we have no mechanism to comply with this rule other than to break fusion for every mutation.

@jjsjann123 would there be a plan to relax this?

mattteochen

lgtm, thanks!

shino16 · 2025-11-27T16:29:18Z

The test failures are due to #2776

shino16 · 2025-11-27T18:19:33Z

Will be fixed by #2777

beverlylytle

This is a good change! However, there will be (hopefully resolvable) conflicts with #2769. Which should be merged first? (Ticking the request changes box to forestall merging before this question is answered.)

shino16 · 2025-12-04T04:57:17Z

I would prefer merging this PR first. There appears to be no conflict between these two PRs, but I will take extra care.

shino16 · 2025-12-05T17:19:33Z

I found a regression:

import torch, thunder

def fn(a):
    b = a * 2
    c = b[:]
    c.tanh_()
    return a * b

jfn = thunder.jit(fn)
x = torch.randn(6, device="cpu", requires_grad=True)
y = jfn(x)
y_ref = fn(x)
print(thunder.last_traces(jfn)[-1])
torch.testing.assert_close(y, y_ref)
# AssertionError: Tensor-likes are not close!

Since this follows the pattern of "mutate a view -> use its alias", I see this as part of #2766. The only difference is that this repro mutates on an intermediate, not an input. I added this as a xfailed test.

This will be fixed by @beverlylytle's draft #2769 (or the patch #2766 (comment)).

thunder/core/update_aliases.py

…r into remove-excess-udpate_aliases

shino16 · 2025-12-12T07:25:07Z

CI failures at 4cee1ad:

____________________ test_complex_backward_custom_autograd _____________________

...

        jf = thunder_jit(f, fusion_type="dataflow")
    
        x = torch.ones(2, 3, device="cuda", requires_grad=True)
    
        # This should not raise an error about variables referenced before assignment.
>       jf(x)

thunder/tests/test_jit_general.py:1233: 

...

        cse_trace.bound_symbols = list(filterfalse(lambda a: a is None, new_symbols))
    
        return_bsym = cse_trace.bound_symbols[-1]
>       assert return_bsym.sym.id == prims.PrimIDs.RETURN
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
E       AssertionError

thunder/executors/nvfuserex_impl.py:821: AssertionError

This error is meant to be fixed in #2777, but I want this PR to be merged before testing #2777.

I implemented a quick workaround that forcefully put return bsym at the end of the trace.

This PR should be ready to merge once the CI passes.

beverlylytle · 2025-12-12T08:15:17Z

thunder/executors/nvfuserex_impl.py

-        return_bsym = cse_trace.bound_symbols[-1]
-        assert return_bsym.sym.id == prims.PrimIDs.RETURN
+        return_bsym = None
+        for idx, bsym in enumerate(cse_trace.bound_symbols):
+            if bsym.sym.id == prims.PrimIDs.RETURN:
+                return_bsym = cse_trace.bound_symbols.pop(idx)
+                break
+        assert return_bsym is not None


I looked at the code changes first before looking at the discussion and this was very alarming to me. Could you add a TODO comment about this being removed?

Sure, I appreciate that kind of feedback. Yes, it's a rough solution indeed...

shino16 · 2025-12-16T13:31:34Z

Hi @beverlylytle, do you think this PR is good to merge? We must wait for #2805 for CI though.

beverlylytle · 2025-12-16T15:08:40Z

Yes, I think it is.

shino16 · 2025-12-17T09:43:12Z

@KaelanDt This PR is ready for your stamp. Thank you!

crcrpar · 2025-12-18T09:58:34Z

thunder/core/update_aliases.py

-            if not group or not (views_encountered := group.intersection(encountered)):
-                # If group is empty, this is a view creation with operands that are not involved in any inplace ops.
-                bsyms.append(bsym.from_bsym_swap_proxies(swap_map, skip_output=True))
+            involved_view_groups = [g for g in view_groups if g.intersection(unswapped_in_tensors)]


qq: wouldn't this call g.intersect len(view_groups) times?

shino16 mentioned this pull request Nov 26, 2025

Too many update_aliases after in-place op #2768

Open

shino16 force-pushed the remove-excess-udpate_aliases branch from 80ee21e to 68ae8df Compare November 26, 2025 16:37

shino16 requested a review from Copilot November 26, 2025 16:58

Copilot started reviewing on behalf of shino16 November 26, 2025 16:59 View session

shino16 added 3 commits November 26, 2025 08:59

Apply swap_map to bsym before any reasoning

929ed00

Keep fusion break before in-place

86adceb

Add test

92663e5

shino16 force-pushed the remove-excess-udpate_aliases branch from 18c18a7 to 92663e5 Compare November 26, 2025 17:00

Copilot finished reviewing on behalf of shino16 November 26, 2025 17:01

Copilot AI reviewed Nov 26, 2025

View reviewed changes

thunder/core/update_aliases.py Show resolved Hide resolved

Cosmetic change

c5d914d

shino16 requested review from crcrpar and mattteochen November 26, 2025 18:14

shino16 marked this pull request as ready for review November 26, 2025 18:15

shino16 requested review from KaelanDt, lantiga and mruberry as code owners November 26, 2025 18:15

shino16 added aliasing in-place labels Nov 26, 2025

shino16 changed the title ~~[WIP] Remove unnecessary repetition of update_aliases~~ Remove unnecessary repetition of update_aliases Nov 26, 2025

crcrpar approved these changes Nov 27, 2025

View reviewed changes

mattteochen approved these changes Nov 27, 2025

View reviewed changes

beverlylytle requested changes Dec 3, 2025

View reviewed changes

shino16 mentioned this pull request Dec 4, 2025

Make in-place ops DCE-able after update_aliases #2777

Draft

Merge branch 'main' into remove-excess-udpate_aliases

ff06df3

beverlylytle approved these changes Dec 4, 2025

View reviewed changes

Remove no longer needed xfail

3954b51

Add regressed case as xfailed test

d765875

shino16 mentioned this pull request Dec 5, 2025

Errors on in-place ops on tensor aliases unresolved by proxy substitution #2766

Open

shino16 added 3 commits December 5, 2025 10:24

Add output of update_aliases to encountered

cc595a1

Fix xfail: Apply variable renaming to view_groups

a0fbd45

Don't depend on consistency of set iteration order

2f6609c

beverlylytle reviewed Dec 5, 2025

View reviewed changes

thunder/core/update_aliases.py Outdated Show resolved Hide resolved

shino16 added 4 commits December 11, 2025 20:57

Merge branch 'main' of ssh://github.com/Lightning-AI/lightning-thunde…

3d46b2f

…r into remove-excess-udpate_aliases

Unswap instead of updating view groups

4cee1ad

Workaround to always put return bsym at last

61a125e

fixup

912adbd

Add comment

ed75fe1

beverlylytle reviewed Dec 12, 2025

View reviewed changes

shino16 and others added 2 commits December 12, 2025 05:03

Add comment

3627ee6

Merge branch 'main' into remove-excess-udpate_aliases

ff3bdb2

Merge branch 'main' into remove-excess-udpate_aliases

cf62b36

Empty commit

1f01b01

crcrpar reviewed Dec 18, 2025

View reviewed changes

Remove unnecessary repetition of update_aliases #2772

Are you sure you want to change the base?

Remove unnecessary repetition of update_aliases #2772

Uh oh!

Conversation

shino16 commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

crcrpar left a comment

Choose a reason for hiding this comment

Uh oh!

mattteochen left a comment

Choose a reason for hiding this comment

Uh oh!

shino16 commented Nov 27, 2025

Uh oh!

shino16 commented Nov 27, 2025

Uh oh!

beverlylytle left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

shino16 commented Dec 4, 2025

Uh oh!

shino16 commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

shino16 commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

beverlylytle Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

shino16 Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

shino16 commented Dec 16, 2025

Uh oh!

beverlylytle commented Dec 16, 2025

Uh oh!

shino16 commented Dec 17, 2025

Uh oh!

crcrpar Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Remove unnecessary repetition of `update_aliases` #2772

Remove unnecessary repetition of `update_aliases` #2772

shino16 commented Nov 26, 2025 •

edited

Loading

beverlylytle left a comment •

edited

Loading

shino16 commented Dec 5, 2025 •

edited

Loading

shino16 commented Dec 12, 2025 •

edited

Loading