Skip to content

ITIndexerTest: validate new data source after reindex#7171

Merged
jihoonson merged 1 commit intoapache:masterfrom
apollographql:glasser/itindexertest-validate
Mar 4, 2019
Merged

ITIndexerTest: validate new data source after reindex#7171
jihoonson merged 1 commit intoapache:masterfrom
apollographql:glasser/itindexertest-validate

Conversation

@glasser
Copy link
Copy Markdown
Contributor

@glasser glasser commented Mar 1, 2019

Previously, the test validated that the data source that we ingested from still
had the same query responses that it did before the second ingestion. This is
less useful than validating queries against the newly created data source.

The new queries file differs from the old one in that its maxTime is earlier due
to the interval selected by the reindex, and in that it does not query for the
dropped metric "count".

Previously, the test validated that the data source that we ingested from still
had the same query responses that it did before the second ingestion. This is
less useful than validating queries against the newly created data source.

The new queries file differs from the old one in that its maxTime is earlier due
to the interval selected by the reindex, and in that it does not query for the
dropped metric "count".
Copy link
Copy Markdown
Member

@clintropolis clintropolis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM 👍

@glasser
Copy link
Copy Markdown
Contributor Author

glasser commented Mar 2, 2019

(The errors highlighted by TeamCity do not appear to be in files touched by this PR.)

@@ -0,0 +1,66 @@
[
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess we don't need this new query? It's same with wikipedia_index_queries.json except that rows is missing in this query.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The differences are:

  • maxTime in the first query is different (because the second ingest pulls a subset of time)
  • not asking for the now-gone count metric in the second query (is that what you mean by rows?)

Are you proposing just not doing queries at all after the reingest? It seems reasonable to validate that some data ended up in the new datasource.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh I see. I missed the 1st change. It looks good to me. Thanks.

@jihoonson jihoonson merged commit 7bf1ee4 into apache:master Mar 4, 2019
@jihoonson jihoonson added this to the 0.15.0 milestone May 16, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants