[SPARK-16663][SQL] desc table should be consistent between data source and hive serde tables #14302

cloud-fan · 2016-07-21T13:15:49Z

What changes were proposed in this pull request?

Currently there are 2 inconsistence:

for data source table, we only print partition names, for hive table, we also print partition schema. After this PR, we will always print schema
if column doesn't have comment, data source table will print empty string, hive table will print null. After this PR, we will always print null

How was this patch tested?

new test in HiveDDLSuite

cloud-fan · 2016-07-21T13:16:10Z

cc @yhuai @liancheng

SparkQA · 2016-07-21T14:39:31Z

Test build #62679 has finished for PR 14302 at commit 090b109.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-07-22T02:49:04Z

Test build #62699 has finished for PR 14302 at commit 6726676.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

…bles

SparkQA · 2016-07-22T05:40:28Z

Test build #62709 has finished for PR 14302 at commit 1ffa49d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

jaceklaskowski · 2016-07-22T14:18:01Z

sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala

-        append(buffer, s"# ${output.head.name}", "", "")
-        partCols.foreach(col => append(buffer, col, "", ""))
+        append(buffer, s"# ${output.head.name}", output(1).name, output(2).name)
+        val partCols = partColNames.map(n => schema.get.find(_.name == n).get)


What do you think about this?

val s = schema.get s.fieldNames.intersect(partColNames).map(s.apply)

I'd rewrite the whole if as following:

for (s <- schema if partColNames.nonEmpty) { append(buffer, "# Partition Information", "", "") append(buffer, s"# ${output.head.name}", output(1).name, output(2).name) describeSchema(StructType(partColNames.map(s(_)))) }

SparkQA · 2016-07-26T03:41:37Z

Test build #62859 has finished for PR 14302 at commit 5a5ddba.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-07-26T06:09:39Z

Test build #62865 has finished for PR 14302 at commit 56338b4.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

liancheng · 2016-07-26T10:44:30Z

LGTM, merging to master.

gatorsmile · 2016-09-03T03:50:49Z

@cloud-fan To backport #14531, we need to backport this. Do you want me to do it?

…e and hive serde tables Currently there are 2 inconsistence: 1. for data source table, we only print partition names, for hive table, we also print partition schema. After this PR, we will always print schema 2. if column doesn't have comment, data source table will print empty string, hive table will print null. After this PR, we will always print null new test in `HiveDDLSuite` Author: Wenchen Fan <wenchen@databricks.com> Closes #14302 from cloud-fan/minor3. (cherry picked from commit a2abb58) Signed-off-by: Wenchen Fan <wenchen@databricks.com>

cloud-fan · 2016-09-03T16:16:48Z

backported to 2.0

cloud-fan force-pushed the minor3 branch from 090b109 to 6726676 Compare July 22, 2016 01:28

cloud-fan force-pushed the minor3 branch from 6726676 to 1ffa49d Compare July 22, 2016 03:48

desc table should be consistent between data source and hive serde ta…

1ffa49d

…bles

jaceklaskowski reviewed Jul 22, 2016
View reviewed changes

address comments

5a5ddba

fix hive

56338b4

asfgit closed this in a2abb58 Jul 26, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-16663][SQL] desc table should be consistent between data source and hive serde tables #14302

[SPARK-16663][SQL] desc table should be consistent between data source and hive serde tables #14302

Uh oh!

cloud-fan commented Jul 21, 2016 •

edited

Loading

Uh oh!

cloud-fan commented Jul 21, 2016

Uh oh!

SparkQA commented Jul 21, 2016

Uh oh!

SparkQA commented Jul 22, 2016

Uh oh!

SparkQA commented Jul 22, 2016

Uh oh!

jaceklaskowski Jul 22, 2016

Uh oh!

liancheng Jul 25, 2016

Uh oh!

SparkQA commented Jul 26, 2016

Uh oh!

SparkQA commented Jul 26, 2016

Uh oh!

liancheng commented Jul 26, 2016

Uh oh!

gatorsmile commented Sep 3, 2016

Uh oh!

cloud-fan commented Sep 3, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[SPARK-16663][SQL] desc table should be consistent between data source and hive serde tables #14302

[SPARK-16663][SQL] desc table should be consistent between data source and hive serde tables #14302

Uh oh!

Conversation

cloud-fan commented Jul 21, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

cloud-fan commented Jul 21, 2016

Uh oh!

SparkQA commented Jul 21, 2016

Uh oh!

SparkQA commented Jul 22, 2016

Uh oh!

SparkQA commented Jul 22, 2016

Uh oh!

jaceklaskowski Jul 22, 2016

Choose a reason for hiding this comment

Uh oh!

liancheng Jul 25, 2016

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Jul 26, 2016

Uh oh!

SparkQA commented Jul 26, 2016

Uh oh!

liancheng commented Jul 26, 2016

Uh oh!

gatorsmile commented Sep 3, 2016

Uh oh!

cloud-fan commented Sep 3, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

cloud-fan commented Jul 21, 2016 •

edited

Loading