Avoid write failures if metrics mode is invalid by rdblue · Pull Request #301 · apache/iceberg

rdblue · 2019-07-19T20:11:15Z

This updates the MetricsConfig class to catch exceptions thrown by MetricsMode.fromString. The intent is to avoid failing write jobs when a metrics mode is invalid, because users may make changes to a table while a pipeline that writes to it is deployed and running. A live pipeline should not fail because of a typo in table tuning settings.

rdblue · 2019-07-19T20:11:27Z

@aokolnychyi, could you review this?

aokolnychyi

I think it is a great idea not to fail jobs if the metrics config is invalid. I would also handle invalid default modes and add a test (maybe to TestMetricsModes).

aokolnychyi · 2019-07-22T06:20:41Z

  public static MetricsConfig fromProperties(Map<String, String> props) {
    MetricsConfig spec = new MetricsConfig();
+    String defaultModeAsString = props.getOrDefault(DEFAULT_WRITE_METRICS_MODE, DEFAULT_WRITE_METRICS_MODE_DEFAULT);
+    spec.defaultMode = MetricsModes.fromString(defaultModeAsString);


Will we fail jobs if the default mode is invalid?
Will it make sense to fallback DEFAULT_WRITE_METRICS_MODE_DEFAULT?

Good idea, we should wrap that in a try/catch as well.

rdblue · 2019-08-01T17:22:02Z

@aokolnychyi, could you take another look?

aokolnychyi · 2019-08-01T18:26:03Z

  public static final String METADATA_COMPRESSION = "write.metadata.compression-codec";
  public static final String METADATA_COMPRESSION_DEFAULT = "none";

+  public static final String METRICS_MODE_COLUMN_CONF_PREFIX = "write.metadata.metrics.column.";


+1 on this. Do we want to have it as WRITE_METRICS_MODE_COLUMN_CONF_PREFIX to be consistent with defaults? Is there a possibility we will have READ_METRICS_MODE_COLUMN_CONF_PREFIX? Not sure.

I think this is fine.

aokolnychyi

LGTM, thanks!

* Add argument validation to HadoopTables#create (#298) * Install source JAR when running install target (#310) * Add projectStrict for Dates and Timestamps (#283) * Correctly publish artifacts on JitPack (#321) The Gradle install target produces invalid POM files that are missing the dependencyManagement section and versions for some dependencies. Instead, we directly tell JitPack to run the correct Gradle target. * Add build info to README.md (#304) * Convert Iceberg time type to Hive string type (#325) * Add overwrite option to write builders (#318) * Fix out of order Pig partition fields (#326) * Add mapping to Iceberg for external name-based schemas (#338) * Site: Fix broken link to Iceberg API (#333) * Add forTable method for Avro WriteBuilder (#322) * Remove multiple literal strings check rule for scala (#335) * Fix invalid javadoc url in README.md (#336) * Use UnicodeUtil.truncateString for Truncate transform. (#340) This truncates by unicode codepoint instead of Java chars. * Refactor metrics tests for reuse (#331) * Spark: Add support for write-audit-publish workflows (#342) * Avoid write failures if metrics mode is invalid (#301) * Fix truncateStringMax in UnicodeUtil (#334) Fixes #328, fixes #329. Index to codePointAt should be the offset calculated by code points * [Vectorization] Added batch sizing, switched to BufferAllocator, other minor style fixes.

Do not fail writes when an invalid metrics mode is in table config.

fbbe2cc

aokolnychyi reviewed Jul 22, 2019

View reviewed changes

rdblue added 3 commits August 1, 2019 10:09

Fix MetricsConfig checkstyle.

b3d9156

Catch invalid default metrics mode.

50af719

Add tests and fix logging.

88bd776

Fix checkstyle problems.

f93a528

aokolnychyi reviewed Aug 1, 2019

View reviewed changes

aokolnychyi approved these changes Aug 1, 2019

View reviewed changes

rdblue merged commit 62d09d7 into apache:master Aug 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid write failures if metrics mode is invalid#301

Avoid write failures if metrics mode is invalid#301
rdblue merged 5 commits intoapache:masterfrom
rdblue:fix-write-failure

rdblue commented Jul 19, 2019

Uh oh!

rdblue commented Jul 19, 2019

Uh oh!

aokolnychyi left a comment

Uh oh!

aokolnychyi Jul 22, 2019

Uh oh!

rdblue Jul 24, 2019

Uh oh!

rdblue Aug 1, 2019

Uh oh!

rdblue commented Aug 1, 2019

Uh oh!

aokolnychyi Aug 1, 2019

Uh oh!

rdblue Aug 1, 2019

Uh oh!

aokolnychyi left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rdblue commented Jul 19, 2019

Uh oh!

rdblue commented Jul 19, 2019

Uh oh!

aokolnychyi left a comment

Choose a reason for hiding this comment

Uh oh!

aokolnychyi Jul 22, 2019

Choose a reason for hiding this comment

Uh oh!

rdblue Jul 24, 2019

Choose a reason for hiding this comment

Uh oh!

rdblue Aug 1, 2019

Choose a reason for hiding this comment

Uh oh!

rdblue commented Aug 1, 2019

Uh oh!

aokolnychyi Aug 1, 2019

Choose a reason for hiding this comment

Uh oh!

rdblue Aug 1, 2019

Choose a reason for hiding this comment

Uh oh!

aokolnychyi left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants