Skip to content

Conversation

@moiseenkov
Copy link
Contributor

Fixed batching for the following operators:

  • BigQueryToPostgresOperator
  • BigQueryToMsSqlOperator
  • BigQueryToMySqlOperator

Previously, if users specify the batch_size parameter it is used only for reading from the BigQuery, however, the writing to the database is performed with the default batch size of 1000 rows. This PR fixes that and the specified batch_size now used for both reading from BigQuery and writing to a database.

Additionally, updated the system test.

@boring-cyborg boring-cyborg bot added area:providers area:system-tests provider:google Google (including GCP) related issues labels Apr 24, 2024
@moiseenkov moiseenkov force-pushed the bigquery_to_postgres_fix_batch branch 3 times, most recently from b7ea281 to f12a935 Compare April 26, 2024 10:34
@moiseenkov moiseenkov force-pushed the bigquery_to_postgres_fix_batch branch from f12a935 to 9017c15 Compare April 26, 2024 14:58
@prithvi-git
Copy link

Performance is still very poor. Opened new ticket #40948

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area:providers area:system-tests provider:google Google (including GCP) related issues

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants