Currently, the twitterstream and wikipedia_editstream datasources used by ITTwitterQueryTest and ITWikipediaQueryTest respectively exist only as pre-ingested segments within AWS S3.
When adding or modifying integration test queries to be run against those datasources, it would be convenient to have access to the event data used to generate those segments.
Perhaps the raw data could be uploaded to S3 as well and a link to that added to the integration test README.
Currently, the
twitterstreamandwikipedia_editstreamdatasources used byITTwitterQueryTestandITWikipediaQueryTestrespectively exist only as pre-ingested segments within AWS S3.When adding or modifying integration test queries to be run against those datasources, it would be convenient to have access to the event data used to generate those segments.
Perhaps the raw data could be uploaded to S3 as well and a link to that added to the integration test README.