Skip to content

Abstractify hadoopy indexer configuration.#1428

Merged
fjy merged 1 commit intoapache:masterfrom
metamx:abstractifyHadoopConfigStuff
Jun 8, 2015
Merged

Abstractify hadoopy indexer configuration.#1428
fjy merged 1 commit intoapache:masterfrom
metamx:abstractifyHadoopConfigStuff

Conversation

@drcrallen
Copy link
Copy Markdown
Contributor

  • Moves many items to JobHelper
  • Remove dependencies of these functions on HadoopDruidIndexerConfig in favor of more general items
  • Changes functionalities of some of the path methods to always return a path with scheme
  • Adds retry to uploads
  • Change output loadSpec determining from using outputFS.getClass().getName() to using outputFS.getScheme()

@drcrallen drcrallen added this to the 0.8.0 milestone Jun 6, 2015
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

log error?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was torn on this one because if it is retrying that should perhaps just be a warning. But if something did go wrong here, I probably would want it to show up in ERROR, so I'll change it

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed

@fjy
Copy link
Copy Markdown
Contributor

fjy commented Jun 8, 2015

It appears most of the changes are just moving methods to JobHelper and using RetryProxy versus the loops we had written before.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we define these as constants?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

They were not previously, but I can

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed

@drcrallen
Copy link
Copy Markdown
Contributor Author

@fjy Yes, this is a migration rather than a major logic change. The migration is needed for upstream PRs though.

@drcrallen drcrallen force-pushed the abstractifyHadoopConfigStuff branch from b2b79a0 to 3e0181f Compare June 8, 2015 17:49
@drcrallen
Copy link
Copy Markdown
Contributor Author

There is one change I forgot to mention in the notes:

The output loadSpec builder changed from being class.getName() based to outputFS.getScheme() based.

* Moves many items to JobHelper
* Remove dependencies of these functions on HadoopDruidIndexerConfig in favor of more general items
* Changes functionalities of some of the path methods to always return a path with scheme
* Adds retry to uploads
* Change output loadSpec determining from using outputFS.getClass().getName() to using outputFS.getScheme()
@drcrallen drcrallen force-pushed the abstractifyHadoopConfigStuff branch from 3e0181f to 2a76bdc Compare June 8, 2015 17:53
@fjy
Copy link
Copy Markdown
Contributor

fjy commented Jun 8, 2015

👍

@nishantmonu51
Copy link
Copy Markdown
Member

LGTM, 👍

fjy added a commit that referenced this pull request Jun 8, 2015
Abstractify hadoopy indexer configuration.
@fjy fjy merged commit 6ae4ecc into apache:master Jun 8, 2015
@drcrallen drcrallen deleted the abstractifyHadoopConfigStuff branch June 8, 2015 18:27
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we explain why those "./" are necessary? They seem unnecessary, but apparently they are not?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

They are required due to Path's interpretation of "something:otherthing" as an absolute instead of relative string, so Path(somePath, "something:otherthing") will cause problems due to the later argument being an 'absolute' path instead of a relative one. But Path(somePath, "./something:otherthing") is fine

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants