Improve `InListExpr` plan display by pepijnve · Pull Request #17884 · apache/datafusion

pepijnve · 2025-10-02T11:23:12Z

Which issue does this PR close?

Closes Spurious 'Use' in InList display output #17883.

Rationale for this change

Aligns the explain output for IN (SET) and NOT IN (SET). The presence of Use is a bit annoying.

What changes are included in this PR?

Removes Use from the formatted output
Adapt tests where necessary

Are these changes tested?

Covered by existing tests

Are there any user-facing changes?

The explain output strings change slightly

pepijnve · 2025-10-02T13:29:56Z

@Jefffrey I'm fixing the failing test. While I'm at it I'm considering adding another change in this PR that uses Display for the set elements rather than Debug. The output now is way too verbose imo.

Example from one of the test cases:

rather than

a@0 IN (SET) ([Literal { value: Utf8("a"), field: Field { name: "lit", data_type: Utf8, nullable: false, dict_id: 0, dict_is_ordered: false, metadata: {} } }, Literal { value: Utf8("b"), field: Field { name: "lit", data_type: Utf8, nullable: false, d ...

we would get

a@0 IN (SET) ([a, b, NULL])

is that ok to include or should I make a separate PR?

Jefffrey · 2025-10-02T13:32:24Z

@Jefffrey I'm fixing the failing test. While I'm at it I'm considering adding another change in this PR that uses Display for the set elements rather than Debug. The output now is way too verbose imo.

Example from one of the test cases:

rather than
a@0 IN (SET) ([Literal { value: Utf8("a"), field: Field { name: "lit", data_type: Utf8, nullable: false, dict_id: 0, dict_is_ordered: false, metadata: {} } }, Literal { value: Utf8("b"), field: Field { name: "lit", data_type: Utf8, nullable: false, d ...
we would get
a@0 IN (SET) ([a, b, NULL])
is that ok to include or should I make a separate PR?

I think that's a great idea to include in this PR; I recall a recent PR where we saw this debug display too which was a bit ugly: #17732 (comment)

alamb · 2025-10-02T14:10:11Z

        let display_string = expr.to_string();
        assert_eq!(sql_string, "a NOT IN (a, b, NULL)");
-        assert_eq!(display_string, "a@0 NOT IN (SET) ([Literal { value: Utf8(\"a\"), field: Field { name: \"lit\", data_type: Utf8, nullable: false, dict_id: 0, dict_is_ordered: false, metadata: {} } }, Literal { value: Utf8(\"b\"), field: Field { name: \"lit\", data_type: Utf8, nullable: false, dict_id: 0, dict_is_ordered: false, metadata: {} } }, Literal { value: Utf8(NULL), field: Field { name: \"lit\", data_type: Utf8, nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} } }])");
+        assert_eq!(display_string, "a@0 NOT IN (SET) ([a, b, NULL])");


I've been wondering about the difference in formatting of literals between logical and physical plans. Logical will show Utf8("a") while physical shows just a. Out of scope for what I'm doing here, but would you want the type information in physical as well? For string literals in particular I think quotes would be a useful addition to avoid any possible confusion with column or table names.

I'll make a separate PR for that since it will probably affect quite a lot of the sql logic test results.

alamb

Thank you @pepijnve

alamb · 2025-10-02T21:00:45Z

        let display_string = expr.to_string();
        assert_eq!(sql_string, "a NOT IN (a, b, NULL)");
-        assert_eq!(display_string, "a@0 NOT IN (SET) ([Literal { value: Utf8(\"a\"), field: Field { name: \"lit\", data_type: Utf8, nullable: false, dict_id: 0, dict_is_ordered: false, metadata: {} } }, Literal { value: Utf8(\"b\"), field: Field { name: \"lit\", data_type: Utf8, nullable: false, dict_id: 0, dict_is_ordered: false, metadata: {} } }, Literal { value: Utf8(NULL), field: Field { name: \"lit\", data_type: Utf8, nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} } }])");
+        assert_eq!(display_string, "a@0 NOT IN (SET) ([a, b, NULL])");


alamb · 2025-10-04T10:52:09Z

Thanks again @pepijnve and @Jefffrey

* Remove spurious `Use` in InListExpr display formatted output * Adapt tpch.slt expected results * Reduce verbosity of Display for InListExpr output * Silence clippy warning (cherry picked from commit d273ffb)

* Remove spurious `Use` in InListExpr display formatted output * Adapt tpch.slt expected results * Reduce verbosity of Display for InListExpr output * Silence clippy warning (cherry picked from commit d273ffb) (cherry picked from commit 050a110)

Remove spurious Use in InListExpr display formatted output

eaba977

github-actions Bot added the physical-expr Changes to the physical-expr crates label Oct 2, 2025

Jefffrey approved these changes Oct 2, 2025

View reviewed changes

Adapt tpch.slt expected results

5d56597

github-actions Bot added the sqllogictest SQL Logic Tests (.slt) label Oct 2, 2025

Reduce verbosity of Display for InListExpr output

20f197e

alamb reviewed Oct 2, 2025

View reviewed changes

Silence clippy warning

00336c4

pepijnve changed the title ~~Remove spurious Use in InListExpr display formatted output~~ Improve InListExpr plan display Oct 2, 2025

alamb approved these changes Oct 2, 2025

View reviewed changes

Jefffrey approved these changes Oct 3, 2025

View reviewed changes

alamb added this pull request to the merge queue Oct 4, 2025

Merged via the queue into apache:main with commit d273ffb Oct 4, 2025
29 checks passed

pepijnve deleted the in_set_explain branch November 3, 2025 16:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve `InListExpr` plan display#17884

Improve `InListExpr` plan display#17884
alamb merged 4 commits intoapache:mainfrom
pepijnve:in_set_explain

pepijnve commented Oct 2, 2025

Uh oh!

pepijnve commented Oct 2, 2025 •

edited

Loading

Uh oh!

Jefffrey commented Oct 2, 2025

Uh oh!

alamb Oct 2, 2025

Uh oh!

pepijnve Oct 2, 2025

Uh oh!

alamb Oct 2, 2025

Uh oh!

pepijnve Oct 3, 2025

Uh oh!

alamb left a comment

Uh oh!

alamb Oct 2, 2025

Uh oh!

alamb commented Oct 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pepijnve commented Oct 2, 2025

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

pepijnve commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Jefffrey commented Oct 2, 2025

Uh oh!

alamb Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

pepijnve Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

alamb Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

pepijnve Oct 3, 2025

Choose a reason for hiding this comment

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

alamb Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

alamb commented Oct 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pepijnve commented Oct 2, 2025 •

edited

Loading