Feat: Allow some control of table naming at the physical layer by erindru · Pull Request #4982 · SQLMesh/sqlmesh

erindru · 2025-07-17T05:44:09Z

Up until now, there has been no control over how SQLMesh generates table names for the physical layer. This presented some issues on engines like Postgres that by default have very short identifier length limits that get triggered unexpectedly because SQLMesh generates table names that contain some extra metadata alongside the model name.

This PR introduces a project-level property called physical_table_naming_convention that has the following values:

schema_and_table - include both the schema name and table name from the model in the snapshot table name (current behaviour; default)
table_only - include the table name only from the model in the snapshot table name with the assumption that the table is being created within a physical schema that already contains the schema name from the model
hash_md5 - produce snapshot table names based on a MD5 hash of what we would have called the table if we were using schema_and_table. This leads to predictable identifier lengths at the expense of readability

georgesittas · 2025-07-18T14:18:34Z

docs/guides/configuration.md

+warehouse.sqlmesh__finance_mart.transaction_events_over_threshold__<fingerprint>
+```
+
+Notice that the model schema name is no longer part of the physical table name. This allows for slightly longer model names on engines with low identifier length limits, which may be useful for your project.


Reading the docs right now doesn't make it obvious why the schema is included in the physical table's name by default. Should we include an example to explain the rationale behind that choice?

To be honest, I don't know what the rationale behind the original choice was.

The only thing I can think of is to allow someone to set:

physical_schema_mapping: '.*': some_schema

Which would map every model to the same physical schema regardless of the model schema. In this situation, including the model schema within the model's table name would be helpful to disambiguate foo.model_a and bar.model_a if they would both be written as some_schema.model_a at the physical layer

So I believe the default exists because it was the lowest common denominator that worked across any rendition of physical_schema_mapping.

I've updated the docs

georgesittas · 2025-07-18T14:58:10Z

Did a quick first pass– the implementation seems reasonable.

izeigerman · 2025-07-24T17:11:29Z

docs/reference/configuration.md

 | `environment_suffix_target`   | Whether SQLMesh views should append their environment name to the `schema` or `table` - [additional details](../guides/configuration.md#view-schema-override). (Default: `schema`)                                                                                                                 | string               | N        |
-| `gateway_managed_virtual_layer`   | Whether SQLMesh views of the virtual layer will be created by the default gateway or model specified gateways - [additional details](../guides/multi_engine.md#gateway-managed-virtual-layer). (Default: False)                                                                                                                 | boolean               | N        |
-| `infer_python_dependencies`   | Whether SQLMesh will statically analyze Python code to automatically infer Python package requirements. (Default: True)                                                                                                                 | boolean               | N        |
+| `physical_table_naming_convention`| Sets which parts of the model name are included in the physical table names. Options are `schema_and_table` or `table_only` - [additional details](../guides/configuration.md#physical-table-naming-convention). (Default: `schema_and_table`)                                                 | string               | N        |


Note: The section you're adding this to is called Environment. I don't think this is the right place for it, or we should rename the section.

I put it here because the other physical_* properties were here, but you're right this could use a refactor. I've:

renamed this section to Environments (Virtual Layer)

moved the project properties like log_limit to the Projects section

moved the model properties like time_column_format to the Models section

added a section called Database (Physical Layer) to house the physical_* properties

izeigerman · 2025-07-24T17:24:02Z

sqlmesh/core/config/root.py

    environment_suffix_target: EnvironmentSuffixTarget = Field(
        default=EnvironmentSuffixTarget.default
    )
+    physical_table_naming_convention: t.Optional[TableNamingConvention] = None


Why not use schema_and_table as a default?

izeigerman · 2025-07-24T17:25:39Z

sqlmesh/core/snapshot/definition.py

    # This can be removed from this model once Pydantic 1 support is dropped (must remain in `Snapshot` though)
    base_table_name_override: t.Optional[str]
    dev_table_suffix: str
+    table_naming_convention: t.Optional[TableNamingConvention] = None


Ditto: I believe we have an appropriate default for this

The nice thing about using None is that it makes the serialized JSON payloads smaller. But I dont have a strong opinion on this, so i've changed these to default to TableNamingConvention.default

izeigerman · 2025-07-24T17:27:58Z

sqlmesh/core/snapshot/definition.py

    base_table_name_override: t.Optional[str] = None
    next_auto_restatement_ts: t.Optional[int] = None
    dev_table_suffix: str = "dev"
+    table_naming_convention_: t.Optional[TableNamingConvention] = Field(


When reusing the physical version (eg. FORWARD_ONLY, METADATA) we should inherit the convention of the previous snapshot just like we do the physical schema: https://github.com/TobikoData/sqlmesh/blob/main/sqlmesh/core/snapshot/definition.py#L1015

Ahh, thanks. I had a suspicion something like this was lurking.

I've added a test to show that categorizing a snapshot as FORWARD_ONLY inherits the table_naming_convention from the previous_version

georgesittas reviewed Jul 18, 2025

View reviewed changes

erindru force-pushed the erin/physical-table-naming-convention branch from ce93b54 to 911bcfc Compare July 23, 2025 21:06

erindru marked this pull request as ready for review July 23, 2025 21:54

izeigerman reviewed Jul 24, 2025

View reviewed changes

erindru added 4 commits July 24, 2025 23:47

Feat: Allow some control of table naming at the physical layer

fee780f

docs feedback

c194506

Update docs

366f3fe

PR feedback

a5496e3

erindru force-pushed the erin/physical-table-naming-convention branch from 911bcfc to a5496e3 Compare July 25, 2025 01:15

izeigerman approved these changes Jul 25, 2025

View reviewed changes

erindru merged commit 9b04f45 into main Jul 27, 2025
27 checks passed

erindru deleted the erin/physical-table-naming-convention branch July 27, 2025 21:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Allow some control of table naming at the physical layer#4982

Feat: Allow some control of table naming at the physical layer#4982
erindru merged 4 commits intomainfrom
erin/physical-table-naming-convention

erindru commented Jul 17, 2025 •

edited

Loading

Uh oh!

georgesittas Jul 18, 2025

Uh oh!

erindru Jul 21, 2025

Uh oh!

erindru Jul 23, 2025

Uh oh!

georgesittas commented Jul 18, 2025

Uh oh!

izeigerman Jul 24, 2025

Uh oh!

erindru Jul 25, 2025

Uh oh!

izeigerman Jul 24, 2025

Uh oh!

izeigerman Jul 24, 2025

Uh oh!

erindru Jul 25, 2025

Uh oh!

izeigerman Jul 24, 2025

Uh oh!

erindru Jul 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

erindru commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

georgesittas commented Jul 18, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

erindru commented Jul 17, 2025 •

edited

Loading