[Backport] Druid automated quickstart by findingrish · Pull Request #13551 · apache/druid

findingrish · 2022-12-12T07:37:03Z

Backports #13365

…anges in druid-quickstart.py

…tion changes

* Firehose migration doc * Update migrate-from-firehose-ingestion.md * Updated with review comments and suggestions * Update migrate-from-firehose-ingestion.md * Update migrate-from-firehose-ingestion.md * Update migrate-from-firehose-ingestion.md

…13402)

* Add sketch fetching framework * Refactor code to support sequential merge * Update worker sketch fetcher * Refactor sketch fetcher * Refactor sketch fetcher * Add context parameter and threshold to trigger sequential merge * Fix test * Add integration test for non sequential merge * Address review comments * Address review comments * Address review comments * Resolve maxRetainedBytes * Add new classes * Renamed key statistics information class * Rename fetchStatisticsSnapshotForTimeChunk function * Address review comments * Address review comments * Update documentation and add comments * Resolve build issues * Resolve build issues * Change worker APIs to async * Address review comments * Resolve build issues * Add null time check * Update integration tests * Address review comments * Add log messages and comments * Resolve build issues * Add unit tests * Add unit tests * Fix timing issue in tests

supervise script changes to process java opts array use argparse, leave free memory, logging

…stry. (apache#13403) * Attach IO error to parse error when we can't contact Avro schema registry. The change in apache#12080 lost the original exception context. This patch adds it back. * Add hamcrest-core. * Fix format string.

* Prepare master branch for next release, 26.0.0 * Use docker image for druid 24.0.1 * Fix version in druid-it-cases pom.xml

… the code

* Suppress jackson-databind CVE-2022-42003 and CVE-2022-42004 (cherry picked from commit 1f4d892) * Suppress CVEs (cherry picked from commit ed55baa) * Suppress vulnerabilities from druid-website package (cherry picked from commit c0fb364) * Add more suppressions for website package (cherry picked from commit 9bba569)

…verview.type=http (apache#13499) * fix issue with http server inventory view blocking data node http server shutdown with long polling * adjust * fix test inspections

* Processors for Window Processing This is an initial take on how to use Processors for Window Processing. A Processor is an interface that transforms RowsAndColumns objects. RowsAndColumns objects are essentially combinations of rows and columns. The intention is that these Processors are the start of a set of operators that more closely resemble what DB engineers would be accustomed to seeing. * Wire up windowed processors with a query type that can run them end-to-end. This code can be used to actually run a query, so yay! * Wire up windowed processors with a query type that can run them end-to-end. This code can be used to actually run a query, so yay! * Some SQL tests for window functions. Added wikipedia data to the indexes available to the SQL queries and tests validating the windowing functionality as it exists now. Co-authored-by: Gian Merlino <gianmerlino@gmail.com>

…ast/slow partition time metrics (apache#13420)

…pression (apache#13508)

Changes: - Limit max batch size in `SegmentAllocationQueue` to 500 - Rename `batchAllocationMaxWaitTime` to `batchAllocationWaitTime` since the actual wait time may exceed this configured value. - Replace usage of `SegmentInsertAction` in `TaskToolbox` with `SegmentTransactionalInsertAction`

…3486) * add padding and keywords * add arrayOfDoubles * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/development/extensions-core/datasketches-tuple.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * partiton int * fix docs Co-authored-by: Charles Smith <techdocsmith@gmail.com>

* Update to native ingestion doc * Update docs/ingestion/native-batch.md Co-authored-by: Kashif Faraz <kashif.faraz@gmail.com> * Update native-batch.md Co-authored-by: Kashif Faraz <kashif.faraz@gmail.com>

* Remove stray reference to fix OOM while merging sketches * Update future to add result from executor service * Update tests and address review comments * Address review comments * Moved mock * Close threadpool on teardown * Remove worker task cancel

* improve compaction status display * even more accurate * fix snapshot

…e#13525) 1) Edited the TooManyBuckets error message to mention PARTITIONED BY instead of segmentGranularity. 2) Added error-code-specific anchors in the docs. 3) Add information to various error codes in the docs about common causes and solutions.

* Enhanced MSQ table functions * HTTP, LOCALFILES and INLINE table functions powered by catalog metadata. * Documentation

…ache#13537) The planner sets sqlInsertSegmentGranularity in its context when using PARTITIONED BY, which sets it on every native query in the stack (as all native queries for a SQL query typically have the same context). QueryKit would interpret that as a request to configure bucketing for all native queries. This isn't useful, as bucketing is only used for the penultimate stage in INSERT / REPLACE. So, this patch modifies QueryKit to only look at sqlInsertSegmentGranularity on the outermost query. As an additional change, this patch switches the static ObjectMapper to use the processwide ObjectMapper for deserializing Granularities. Saves an ObjectMapper instance, and ensures that if there are any special serdes registered for Granularity, we'll pick them up.

rishabh singh and others added 30 commits November 15, 2022 11:08

Druid automated quickstart

8c9dcc2

remove conf/druid/single-server/quickstart/_common/historical/jvm.config

342e5d7

Minor changes in python script

6a4a48e

Add lower bound memory for some services

36b86b6

Additional runtime properties for services

dbfb465

Update supervise script to accept command arguments, corresponding ch…

7324252

…anges in druid-quickstart.py

File end newline

72494e9

Limit the ability to start multiple instances of a service, documenta…

e350587

…tion changes

simplify script arguments

68a1e66

restore changes in medium profile

e570606

run-druid refactor

b413d15

Firehose migration doc (apache#12981)

68018a8

* Firehose migration doc * Update migrate-from-firehose-ingestion.md * Updated with review comments and suggestions * Update migrate-from-firehose-ingestion.md * Update migrate-from-firehose-ingestion.md * Update migrate-from-firehose-ingestion.md

add ability to make inputFormat part of the example datasets (apache#…

fe34ecc

…13402)

compute and pass middle manager runtime properties to run-druid

9b8cdc6

supervise script changes to process java opts array use argparse, leave free memory, logging

Remove extra quotes from mm task javaopts array

75d169f

Update logic to compute minimum memory

a891319

simplify run-druid

5692753

remove debug options from run-druid

d111012

Prepare master branch for next release, 26.0.0 (apache#13401)

7cf761c

* Prepare master branch for next release, 26.0.0 * Use docker image for druid 24.0.1 * Fix version in druid-it-cases pom.xml

resolve the config_path provided

a9b24e2

comment out service specific runtime properties which are computed in…

71bfca1

… the code

simplify run-druid

032bae0

fix off by one error in nested column range index (apache#13405)

be4914d

clean up docs, naming changes

f55c749

Throw ValueError exception on illegal state

2a0dd34

update docs

c35cc1e

rename args, compute_only -> compute, run_zk -> zk

891eaaa

clintropolis and others added 29 commits December 6, 2022 15:52

fix issue with jetty graceful shutdown of data servers when druid.ser…

cf47216

…verview.type=http (apache#13499) * fix issue with http server inventory view blocking data node http server shutdown with long polling * adjust * fix test inspections

fix bug with broker parallel merge metrics emitting, add wall time, f…

37d8833

…ast/slow partition time metrics (apache#13420)

Better error message when theta_sketch_intersect is used on scalar ex…

b25cf21

…pression (apache#13508)

merge from master

6062c33

Update to native ingestion doc (apache#13482)

b56855b

* Update to native ingestion doc * Update docs/ingestion/native-batch.md Co-authored-by: Kashif Faraz <kashif.faraz@gmail.com> * Update native-batch.md Co-authored-by: Kashif Faraz <kashif.faraz@gmail.com>

Fix typo in metric name (apache#13521)

6995127

update start-druid

d5d81f4

Update python.md

5942eff

Update single-server.md

a4d14f7

Web console: improve compaction status display (apache#13523)

d85fb8c

* improve compaction status display * even more accurate * fix snapshot

merge from master

0742c77

Update python.md

41ad0bd

run python3 --version to check if python is installed

08507b4

update error anchors (apache#13527)

d8e27ea

Enhanced MSQ table functions (apache#13360)

013a12e

* Enhanced MSQ table functions * HTTP, LOCALFILES and INLINE table functions powered by catalog metadata. * Documentation

merge from master

48ffe3f

Update supervise script

14b8679

start-druid: echo message if python not found

2837dd3

update anchor text

511583e

minor change

d569693

Update condition in supervise script

d0e7374

JVM not jvm in docs

e3d3657

merge from master

4360f2f

findingrish closed this Dec 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Backport] Druid automated quickstart#13551

[Backport] Druid automated quickstart#13551
findingrish wants to merge 148 commits intoapache:25.0.0from
findingrish:backport_druid_quickstart

findingrish commented Dec 12, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

findingrish commented Dec 12, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants