Skip to content

Conversation

@mobuchowski
Copy link
Contributor

Currently, we call Snowflake (and other DBs) to get schemas of tables used in particular queries.

However, Snowflake connector management is buggy and leaves us in a hanging, deadlocked state sometimes, which causes OpenLineage collection to fail and .

This PR prevents that by never calling Snowflake during OpenLineage method execution, and relying purely on information received from SQL parsing and Airflow Connection.

The negative consequence of that PR is lack of schema information in OpenLineage events, but this is a good tradeoff - most important feature of OpenLineage should be not affecting running tasks.

@RNHTTR
Copy link
Contributor

RNHTTR commented Apr 18, 2024

tests are failing ☹️

Signed-off-by: Maciej Obuchowski <obuchowski.maciej@gmail.com>
@mobuchowski mobuchowski force-pushed the snowflake-openlineage-dontuseexternalconnection branch from e94fb08 to 3b26f58 Compare April 18, 2024 15:53
@mobuchowski mobuchowski requested a review from o-nikolas as a code owner April 18, 2024 15:53
@mobuchowski
Copy link
Contributor Author

@RNHTTR not anymore :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants