fix: SanityCheckPlan error with window functions and NVL filter #20231

EeshanBembi · 2026-02-09T10:48:00Z

Which issue does this PR close?

Closes #20194

Rationale for this change

A query with ROW_NUMBER() OVER (... ORDER BY CASE WHEN col='0' THEN 1 ELSE 0 END) combined with a filter nvl(t2.value_2_3,'0')='0' fails with a SanityCheckPlan error. This worked in 50.3.0 but broke in 52.1.0.

What changes are included in this PR?

Root cause: collect_columns_from_predicate_inner was extracting equality pairs where neither side was a Column (e.g. nvl(col, '0') = '0'), creating equivalence classes between complex expressions and literals. normalize_expr's deep traversal would then replace the literal '0' inside unrelated sort/window CASE WHEN expressions with the complex NVL expression, corrupting the sort ordering and causing a mismatch between SortExec's reported output ordering and BoundedWindowAggExec's expected ordering.

Fix (two changes in filter.rs):

collect_columns_from_predicate_inner: Only extract equality pairs where at least one side is a Column reference. This matches the function's documented intent ("Column-Pairs") and prevents complex-expression-to-literal equivalence classes from being created.
extend_constants: Recognize Literal expressions as inherently constant (previously only checked is_expr_constant on the input's equivalence properties, which doesn't know about literals). This ensures constant propagation still works for complex_expr = literal predicates — e.g. nvl(col, '0') is properly marked as constant after the filter.

How was this tested?

Unit test test_collect_columns_skips_non_column_pairs verifying the filtering logic
Sqllogictest reproducing the exact query from the issue
Full test suites: equivalence tests (51 passed), physical-plan tests (1255 passed), physical-optimizer tests (20 passed)
Manual verification with datafusion-cli running the reproduction query

Test plan

Unit test for collect_columns_from_predicate_inner column filtering
Sqllogictest regression test for SanityCheckPlan caused by Error during planning: #20194
Existing test suites pass
Manual reproduction query succeeds

…he#20194) `collect_columns_from_predicate_inner` was extracting equality pairs where neither side was a Column (e.g. `nvl(col, '0') = '0'`), creating equivalence classes between complex expressions and literals. `normalize_expr`'s deep traversal would then replace the literal inside unrelated sort/window expressions with the complex expression, corrupting the sort ordering and triggering SanityCheckPlan failures. Fix by restricting `collect_columns_from_predicate_inner` to only extract pairs where at least one side is a Column reference, matching the function's documented intent. Also update `extend_constants` to recognize Literal expressions as inherently constant, so constant propagation still works for `complex_expr = literal` predicates. Closes apache#20194

github-actions bot added sqllogictest SQL Logic Tests (.slt) physical-plan Changes to the physical-plan crate labels Feb 9, 2026

EeshanBembi force-pushed the fix/sanity-check-nvl-equivalence-20194 branch from 5543f1f to 244be39 Compare February 9, 2026 10:57

EeshanBembi marked this pull request as ready for review February 9, 2026 14:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: SanityCheckPlan error with window functions and NVL filter #20231

fix: SanityCheckPlan error with window functions and NVL filter #20231

EeshanBembi commented Feb 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix: SanityCheckPlan error with window functions and NVL filter #20231

Are you sure you want to change the base?

fix: SanityCheckPlan error with window functions and NVL filter #20231

Conversation

EeshanBembi commented Feb 9, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How was this tested?

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant