Quote column names in JDBC schemaString #85

JoshRosen · 2015-09-12T06:46:42Z

This patch modifies JDBCWrapper.schemaString to wrap column names in quotes, which is necessary in order to allow us to create tables with columns whose names are reserved words or which contain spaces. This fixes #80.

Note that, by itself, this patch does not enable full support for creating Redshift tables with column names that contain spaces; we are currently constrained by Avro's schema validation rules (see #84).

…names.

JoshRosen · 2015-09-12T06:48:08Z

@marmbrus, do you think that we should perform similar quoting in Spark SQL's built-in JDBC datasource? Is this type of quoting a dialect-specific thing? These questions aren't blockers to making this change here in spark-redshift, but I just wanted to briefly consider those questions to make sure that we're not overlooking potential bugs in Spark.

codecov-io · 2015-09-12T07:14:38Z

Current coverage is `94.59%`

Merging #85 into master will not affect coverage as of 9f763ea

@@            master     #85   diff @@
======================================
  Files           11      11       
  Stmts          444     444       
  Branches       105     105       
  Methods          0       0       
======================================
  Hit            420     420       
  Partial          0       0       
  Missed          24      24

Review entire Coverage Diff as of 9f763ea

Powered by Codecov. Updated on successful CI builds.

marmbrus · 2015-09-13T19:00:01Z

Yeah, this is a known issue in Spark (SPARK-9505) as well. However there I think we will have to work it into dialects as I think different systems use different quoting mechanisms (at least Spark SQL is different than MySQL).

Do we need to do the same thing when querying such columns? or do we already escape there?

rxin · 2015-09-13T19:06:00Z

The JDBC dialect dev API already has a quoting mechanism defined.

On Sep 13, 2015, at 12:00 PM, Michael Armbrust notifications@github.com
wrote:

Yeah, this is a known issue in Spark (SPARK-9505
https://issues.apache.org/jira/browse/SPARK-9505) as well. However there
I think we will have to work it into dialects as I think different systems
use different quoting mechanisms (at least Spark SQL is different than
MySQL).

Do we need to do the same thing when querying such columns? or do we
already escape there?

—
Reply to this email directly or view it on GitHub
#85 (comment)
.

JoshRosen · 2015-09-14T00:49:31Z

@marmbrus, we already wrap in quotes when querying; it looks like we were just missing support for this when creating tables.

JoshRosen · 2015-09-14T18:11:50Z

Going to merge this now.

JoshRosen · 2015-09-14T18:17:52Z

Added an unload to this test, just to make it clear that the read path is also covered.

JoshRosen added 2 commits September 11, 2015 18:57

Add failing regression test for creating tables with reserved column …

484608e

…names.

Quote column names in JDBC schemaString.

659dd34

JoshRosen added the bug label Sep 12, 2015

JoshRosen added this to the 0.5.1 milestone Sep 12, 2015

Add round-trip read.

33946ba

Remove unused "name" column metadata.

2b2ec78

JoshRosen closed this in 4dcf6e9 Sep 14, 2015

JoshRosen deleted the column-name-escaping branch September 14, 2015 19:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Quote column names in JDBC schemaString #85

Quote column names in JDBC schemaString #85

Uh oh!

JoshRosen commented Sep 12, 2015

Uh oh!

JoshRosen commented Sep 12, 2015

Uh oh!

codecov-io commented Sep 12, 2015

Uh oh!

marmbrus commented Sep 13, 2015

Uh oh!

rxin commented Sep 13, 2015

Uh oh!

JoshRosen commented Sep 14, 2015

Uh oh!

JoshRosen commented Sep 14, 2015

Uh oh!

JoshRosen commented Sep 14, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Quote column names in JDBC schemaString #85

Quote column names in JDBC schemaString #85

Uh oh!

Conversation

JoshRosen commented Sep 12, 2015

Uh oh!

JoshRosen commented Sep 12, 2015

Uh oh!

codecov-io commented Sep 12, 2015

Current coverage is 94.59%

Uh oh!

marmbrus commented Sep 13, 2015

Uh oh!

rxin commented Sep 13, 2015

Uh oh!

JoshRosen commented Sep 14, 2015

Uh oh!

JoshRosen commented Sep 14, 2015

Uh oh!

JoshRosen commented Sep 14, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Current coverage is `94.59%`