Make columnarAM dependent on rel objects in columnar schema #5456

onurctirtir · 2021-11-10T14:43:51Z

DESCRIPTION: Fixes a bug that could break pg upgrades due to missing pg_depend records for columnar table access method

First commit enables storing pg_upgrade logs for new data dir in case of failure.
(Reverting third commit would reveal how it looks like)
Second commit adds a test that reproduces the bug on our regression test suite.
The only scenario that I could come up with was renaming the schema (public schema in our case) to something else, then creating the columnar table there.
(Reverting third commit would make this branch reproduce the bug on CI)
Third commit is the commit that fixes the bug:

During pg upgrades, we have seen that it is not guaranteed that a
columnar table will be created after metadata objects got created.
Prior to changes done in this commit, we had such a dependency
relationship in pg_depend:

columnar_table ----> columnarAM ----> citus extension
                                           ^  ^
                                           |  |
columnar.storage_id_seq --------------------  |
                                              |
columnar.stripe -------------------------------

Since pg_upgrade just knows to follow topological sort of the objects
when creating database dump, above dependency graph doesn't imply that
columnar_table should be created before metadata objects such as
columnar.storage_id_seq and columnar.stripe are created.

For this reason, with this commit we add new records to pg_depend to
make columnarAM depending on all rel objects living in columnar
schema. That way, pg_upgrade will know it needs to create those before
creating columnarAM, and similarly, before creating any tables using
columnarAM.

Note that in addition to inserting those records via installation script, we also
do the same in citus_finish_pg_upgrade(). This is because, pg_upgrade
rebuilds catalog tables in the new cluster and that means, we must insert
them in the new cluster too.

Given that we decided not to backport this fix to older versions, here is the workaround for 10.0 & 10.1:
#5456 (comment).

codecov · 2021-11-10T14:45:38Z

Codecov Report

Merging #5456 (debca26) into master (8c0bc94) will decrease coverage by 0.00%.
The diff coverage is n/a.

❗ Current head debca26 differs from pull request most recent head 2026166. Consider uploading reports for the commit 2026166 to get more accurate results

@@            Coverage Diff             @@
##           master    #5456      +/-   ##
==========================================
- Coverage   92.74%   92.74%   -0.01%     
==========================================
  Files         215      215              
  Lines       45321    45321              
==========================================
- Hits        42035    42031       -4     
- Misses       3286     3290       +4

onurctirtir · 2021-11-10T14:47:08Z

I wonder what @SaitTalhaNisanci would say for the first and second commits.
For the third one, it would be very nice if @thanodnl can review.

src/test/regress/citus_tests/upgrade/pg_upgrade_test.py

.circleci/config.yml

src/backend/columnar/sql/columnar--11.0-1--11.0-2.sql

src/backend/columnar/sql/downgrades/columnar--11.0-2--11.0-1.sql

src/backend/distributed/citus.control

src/test/regress/expected/multi_extension.out

thanodnl

Code looks good to me.

As commented on a thread, I would like to see one change.

Even though today having duplicate entries in pg_depend seems to not be an issue, I don't think postgres has codepaths where a duplicate entry can exists.

Lets treat entries in pg_depend as unique, and prevent them to be added twice.
This could either be done with an INSERT ON CONFLICT DO NOTHING assuming this works with the INSERT INTO SELECT clause.

Otherwise, lets filter the selected rows by what is already present in pg_depend and insert only the missing ones.

It seems that pg15 might get support for SQL Merge: https://www.postgresql.org/message-id/flat/20210108192200.GA25633%40alvherre.pgsql#a8636c22f696aa519c1cdbb824b70e51 which is a bit far out for this patch.

onurctirtir · 2021-11-15T12:29:24Z

Code looks good to me.

As commented on a thread, I would like to see one change.

Even though today having duplicate entries in pg_depend seems to not be an issue, I don't think postgres has codepaths where a duplicate entry can exists.

Lets treat entries in pg_depend as unique, and prevent them to be added twice. This could either be done with an INSERT ON CONFLICT DO NOTHING assuming this works with the INSERT INTO SELECT clause.

Otherwise, lets filter the selected rows by what is already present in pg_depend and insert only the missing ones.

It seems that pg15 might get support for SQL Merge: https://www.postgresql.org/message-id/flat/20210108192200.GA25633%40alvherre.pgsql#a8636c22f696aa519c1cdbb824b70e51 which is a bit far out for this patch.

I guess ON CONFLICT requires having a unique/exclusion constraint that covers the columns that we are interested in :
ERROR: there is no unique or exclusion constraint matching the ON CONFLICT specification

Even more, catalog tables don't seem to support ON CONFLICT anyway:
ERROR: ON CONFLICT is not supported with system catalog tables

So I will check if the entry already exists using a CTE etc.

onurctirtir · 2021-11-17T10:12:25Z

Given that we will only backport this patch to 10.2, below is the command that needs to be run on older versions (10.0 & 10.1) to manually fix this issue there. Note that the only difference is the commented line for stripe_first_row_number_idx.
Also note that below command should be run on all nodes, before calling citus_prepare_pg_upgrade():

INSERT INTO pg_depend
SELECT -- Define a dependency edge from "columnar table access method" ..
        'pg_am'::regclass::oid as classid,
        (select oid from pg_am where amname = 'columnar') as objid,
        0 as objsubid,
        -- ... to each object that is registered to pg_class and that lives
        -- in "columnar" schema. That contains catalog tables, indexes
        -- created on them and the sequences created in "columnar" schema.
        --
        -- Given the possibility of user might have created their own objects
        -- in columnar schema, we explicitly specify list of objects that we
        -- are interested in.
        'pg_class'::regclass::oid as refclassid,
        columnar_schema_members.relname::regclass::oid as refobjid,
        0 as refobjsubid,
        'n' as deptype
FROM (VALUES ('columnar.chunk'),
            ('columnar.chunk_group'),
            ('columnar.chunk_group_pkey'),
            ('columnar.chunk_pkey'),
            ('columnar.options'),
            ('columnar.options_pkey'),
            ('columnar.storageid_seq'),
            ('columnar.stripe'),
            -- ('columnar.stripe_first_row_number_idx'), -- ignore for 10.0 & 10.1, otherwise, would throw an error
            ('columnar.stripe_pkey')
     ) columnar_schema_members(relname)
-- Avoid inserting duplicate entries into pg_depend.
EXCEPT TABLE pg_depend;

…oesn't exist Fixes #6570. In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, instead of inserting such dependency edges from indexes to columnar-am, we allow columnar metadata accessors to fall-back to sequential scan during pg upgrades.

…5456 Fixes #5510. In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, this commit deletes those dependency edges so that pg_dump stops complaining about them. Note that it's not critical to delete those edges from pg_depend since they're not breaking pg upgrades but were triggering some warning messages. And given that backporting a sql change into older versions is hard a lot, we skip backporting this.

…oesn't exist Fixes #6570. In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, instead of inserting such dependency edges from indexes to columnar-am, we allow columnar metadata accessors to fall-back to sequential scan during pg upgrades.

…5456 Fixes #5510. In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, this commit deletes those dependency edges so that pg_dump stops complaining about them. Note that it's not critical to delete those edges from pg_depend since they're not breaking pg upgrades but were triggering some warning messages. And given that backporting a sql change into older versions is hard a lot, we skip backporting this.

…oesn't exist Fixes #6570. In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, instead of inserting such dependency edges from indexes to columnar-am, we allow columnar metadata accessors to fall-back to sequential scan during pg upgrades.

…oesn't exist Fixes #6570. In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, instead of inserting such dependency edges from indexes to columnar-am, we allow columnar metadata accessors to fall-back to sequential scan during pg upgrades. (cherry picked from commit 1c51dda)

…oesn't exist Fixes #6570. In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, instead of inserting such dependency edges from indexes to columnar-am, we allow columnar metadata accessors to fall-back to sequential scan during pg upgrades. (cherry picked from commit 1c51dda) (cherry picked from commit b9e1840)

In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, this commit deletes those dependency edges so that pg_dump stops complaining about them. Note that it's not critical to delete those edges from pg_depend since they're not breaking pg upgrades but were triggering some warning messages. And given that backporting a sql change into older versions is hard a lot, we skip backporting this.

…m (inserted in #5456) (#6628) DESCRIPTION: Fixes (pg_dump/pg_upgrade) dependency loop warnings caused by pg_depend entries inserted by citus_columnar Fixes #5510. In the past, having columnar tables in the cluster was causing pg upgrades to fail when attempting to access columnar metadata. This is because, pg_dump doesn't see objects that we use for columnar-am related booking as the dependencies of the tables using columnar-am. To fix that; in #5456, we inserted some "normal dependency" edges (from those objects to columnar-am) into pg_depend. This helped us ensuring the existency of a class of metadata objects --such as columnar.storageid_seq-- and helped fixing #5437. However, the normal-dependency edges that we added for indexes on columnar metadata tables --such columnar.stripe_pkey-- didn't help at all because they were indeed causing dependency loops (#5510) and pg_dump was not able to take those dependency edges into the account. For this reason, this commit deletes those dependency edges so that pg_dump stops complaining about them. Note that it's not critical to delete those edges from pg_depend since they're not breaking pg upgrades but were triggering some warning messages. And given that backporting a sql change into older versions is hard a lot, we skip backporting this.

onurctirtir added backport columnar labels Nov 10, 2021

onurctirtir requested review from SaitTalhaNisanci and thanodnl November 10, 2021 14:45

onurctirtir added cherry-pick-10.0 cherry-pick-10.1 cherry-pick-10.2 and removed backport labels Nov 10, 2021

onurctirtir mentioned this pull request Nov 10, 2021

Skip deleting options if columnar.options is already dropped #5458

Merged

SaitTalhaNisanci reviewed Nov 10, 2021

View reviewed changes

src/test/regress/citus_tests/upgrade/pg_upgrade_test.py Outdated Show resolved Hide resolved

onurctirtir force-pushed the col/pg-upgrade-dependency branch 4 times, most recently from 2b8c6e7 to fd093ab Compare November 10, 2021 17:49

SaitTalhaNisanci reviewed Nov 11, 2021

View reviewed changes

.circleci/config.yml Outdated Show resolved Hide resolved

SaitTalhaNisanci reviewed Nov 11, 2021

View reviewed changes

src/test/regress/expected/multi_extension.out Show resolved Hide resolved

onurctirtir force-pushed the col/pg-upgrade-dependency branch 2 times, most recently from e4dad7b to 33c3865 Compare November 12, 2021 09:58

thanodnl reviewed Nov 15, 2021

View reviewed changes

onurctirtir requested a review from thanodnl November 15, 2021 14:26

thanodnl approved these changes Nov 15, 2021

View reviewed changes

onurctirtir force-pushed the col/pg-upgrade-dependency branch from debca26 to d72ad84 Compare November 15, 2021 15:18

onurctirtir removed cherry-pick-10.0 cherry-pick-10.1 labels Nov 17, 2021

onurctirtir requested a review from thanodnl November 17, 2021 11:57

onurctirtir mentioned this pull request Jan 18, 2023

Remove pg_depend entries from columnar metadata indexes to columnar-am (inserted in #5456) #6628

Merged

onurctirtir mentioned this pull request Aug 1, 2025

Not automatically create citus_columnar when creating citus extension #8081

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make columnarAM dependent on rel objects in columnar schema #5456

Make columnarAM dependent on rel objects in columnar schema #5456

Uh oh!

onurctirtir commented Nov 10, 2021 •

edited

Loading

Uh oh!

codecov bot commented Nov 10, 2021 •

edited

Loading

Uh oh!

onurctirtir commented Nov 10, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thanodnl left a comment

Uh oh!

onurctirtir commented Nov 15, 2021

Uh oh!

onurctirtir commented Nov 17, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Make columnarAM dependent on rel objects in columnar schema #5456

Make columnarAM dependent on rel objects in columnar schema #5456

Uh oh!

Conversation

onurctirtir commented Nov 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Nov 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

onurctirtir commented Nov 10, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thanodnl left a comment

Choose a reason for hiding this comment

Uh oh!

onurctirtir commented Nov 15, 2021

Uh oh!

onurctirtir commented Nov 17, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

onurctirtir commented Nov 10, 2021 •

edited

Loading

codecov bot commented Nov 10, 2021 •

edited

Loading

onurctirtir commented Nov 17, 2021 •

edited

Loading