Druid 31.0.0 release notes by writer-jill · Pull Request #17092 · apache/druid

writer-jill · 2024-09-17T17:26:48Z

Release and upgrade notes for Druid 31.0.0

This PR has:

been self-reviewed.

abhishekagarwal87 · 2024-10-01T08:14:08Z


 This section contains important information about new and existing features.

+### Compaction on MSQ


lets rename this headline to Compaction Features. And then list

Compaction scheduler with greater flexibility and control over when and what to compact

MSQ Based Compaction for performant compaction jobs

Concurrent compaction is now GA

No need to list all the nitty-gritty details as you have done right now. They just move to the different section or in the docs

updated - asked @317brian to add the detail to the compaction docs.

Co-authored-by: Kashif Faraz <kashif.faraz@gmail.com>

Co-authored-by: 317brian <53799971+317brian@users.noreply.github.com>

adarshsanjeev · 2024-10-08T04:49:35Z

+- Fixed an issue with `ScanQueryFrameProcessor` cursor build not adjusting intervals [#17168](https://github.com/apache/druid/pull/17168)
+- Improved worker cancellation for the MSQ task engine to prevent race conditions [#17046](https://github.com/apache/druid/pull/17046)
+- Improved memory management to better support multi-threaded workers [#17057](https://github.com/apache/druid/pull/17057)
+- Reduced memory usage when transferring sketches between the MSQ task engine controller and worker [#16269](https://github.com/apache/druid/pull/16269)


This is duplicated from line 245. Also, a better way to word it might be "Add new format for serialization of sketches between MSQ controller and worker to reduce memory usage".

Akshat-Jain · 2024-10-08T05:33:28Z

+- Improved memory management to better support multi-threaded workers [#17057](https://github.com/apache/druid/pull/17057)
+- Reduced memory usage when transferring sketches between the MSQ task engine controller and worker [#16269](https://github.com/apache/druid/pull/16269)
+- Improved error handling when retrieving Avro schemas from registry [#16684](https://github.com/apache/druid/pull/16684)
+- Fixed issues related to partitioning boundaries in the MSQ task engine's window functions [#16729](https://github.com/apache/druid/pull/16729)


This is duplicated from line 247.

Also, it's a nit but a better message might be: Fixed issues related to partitioning boundaries for window functions in the MSQ task engine

Akshat-Jain · 2024-10-08T05:37:07Z

-##### Other streaming ingestion improvements
+[#16358](https://github.com/apache/druid/pull/16358)
+
+#### Other SQL-based ingestion improvements


Is this file expected to contain all PRs marked with milestone 31.0.0? For example, I don't see #16804 mentioned, is that expected?

Hey Akshat, we typically don't include bug fixes unless there's a specific reason to. It's just new features/improvements. There are currently some fixes in there that I'll remove as part of the final cleanup.

It looks like 16804 and 17141 didn't have the bug labeled applied. Was that intentional?

I see, thanks for the info!

It looks like 16804 and 17141 didn't have the bug labeled applied. Was that intentional?

Nope. I don't have the access to update PR labels, but yes both those PRs are bug-fixes.

Akshat-Jain · 2024-10-08T05:44:38Z

+- Reduced memory usage when transferring sketches between the MSQ task engine controller and worker [#16269](https://github.com/apache/druid/pull/16269)
+- Improved error handling when retrieving Avro schemas from registry [#16684](https://github.com/apache/druid/pull/16684)
+- Fixed issues related to partitioning boundaries in the MSQ task engine's window functions [#16729](https://github.com/apache/druid/pull/16729)
+- Fixed a boost column issue causing quantile sketches to incorrectly estimate the number of output partitions to create [#17141](https://github.com/apache/druid/pull/17141)


Nit: This is MSQ window function specific, so we can maybe add that to the message: Fixed a boost column issue causing quantile sketches to incorrectly estimate the number of output partitions to create for window functions in MSQ task engine

Also, I see this PR also mentioned in the Other querying improvements section - is that expected?

Nope, it should not be duplicated. Will remove

LakshSingla

#16887 is not added in the release notes. A line item somewhere would be good.

317brian · 2024-10-09T15:03:34Z

#16887 is not added in the release notes. A line item somewhere would be good.

@LakshSingla It doesn't look like it's in the milestone. Should I add it to the milestone too?

clintropolis · 2024-10-11T16:43:47Z

+
+### Projections (experimental)
+
+Druid 31.0.0 includes experimental support for projections in segments. Like materialized views, projections can improve the performance of queries by optimizing the route the query takes when it executes.


ok, i gave this a shot, also included some instruction on how to use the feature since it isn't documented yet

Druid 31.0.0 includes experimental support for new feature called projections. Projections are grouped pre-aggregates of a segment that are automatically used at query time to optimize execution for any queries which 'fit' the shape of the projection by reducing both computation and i/o cost by reducing the number of rows which need to be processed. Projections are contained within segments of a datasource, and do increase the segment size, but are also able to share data such as value dictionaries of dictionary encoded columns with the columns of the base segment.

As an experimental feature, projections are not well documented yet, but can be defined for streaming ingestion and 'classic' batch ingestion as part of the dataSchema. For example, using the standard wikipedia example:

"dataSchema": { "granularitySpec": { ... }, "dataSource": ..., "timestampSpec": { ... }, "dimensionsSpec": { ... }, "projections": [ { "type": "aggregate", "name": "channel_page_hourly_distinct_user_added_deleted", "groupingColumns": [ { "type": "long", "name": "__gran" }, { "type": "string", "name": "channel" }, { "type": "string", "name": "page" } ], "virtualColumns": [ { "type": "expression", "expression": "timestamp_floor(__time, 'PT1H')", "name": "__gran", "outputType": "LONG" } ], "aggregators": [ { "type": "HLLSketchBuild", "name": "distinct_users", "fieldName": "user", "round": true }, { "type": "longSum", "name": "sum_added", "fieldName": "added" }, { "type": "longSum", "name": "sum_deleted", "fieldName": "deleted" } ] }, ... ] }, ...

The groupingColumns define the order which data is sorted in the projection. Instead of explicitly defining granularity like for the base table, it is defined by defining a virtual column; during ingestion the processing logic finds the ‘finest’ granularity virtual column that is a timestamp_floor expression and uses it as the __time column for the projection. Projections do not need to have a time column defined, in which case they can still match queries that are not grouping on time.

Projections only can currently be defined by classic ingestion, but they can still be used by queries using MSQ or the new Dart engine. Future development will allow projections to be created as part of MSQ based ingestion as well.

There are a few new query context flags which have been added to aid in experimentation with projections.

useProjection accepts a specific projection name and instructs the query engine that it must use that projection, and will fail the query if the projection does not match the query

forceProjections accepts true or false and instructs the query engine that it must use a projection, and will fail the query if it cannot find a matching projection

noProjections accpets true or false and instructs the query engines to not use any projections

We have a lot of plans to continue to improve this feature in the coming releases, but are excited to get it out there so users can begin experimentation since projections can dramatically improve query performance.

i'm still working on the writeup for a design proposal for this, another option would be to link to that from this since it should contain some of this information

I split this up. Part of it is in the highlight section and the details are in the Querying section. Also instead of including the JSON, I linked to it.

Druid 31.0.0 release notes

59ec903

github-actions Bot added the Area - Documentation label Sep 17, 2024

writer-jill mentioned this pull request Sep 17, 2024

Druid 31.0.0 release notes (closed) #17089

Closed

1 task

techdocsmith added this to the 31.0.0 milestone Sep 17, 2024

writer-jill added 4 commits September 26, 2024 15:05

Added most recent batch of release notes.

ffeea03

Added apache#17024 to developer notes.

d0f9cb1

Added apache#16874

2fe29ab

Removed apache#16667 - Kashif says not user-facing change

cdbace5

abhishekagarwal87 reviewed Oct 1, 2024

View reviewed changes

writer-jill added 2 commits October 2, 2024 14:51

Updated after review

b4fb9a5

Updated after review

1d17042

abhishekrb19 reviewed Oct 2, 2024

View reviewed changes

kfaraz reviewed Oct 3, 2024

View reviewed changes

Comment thread docs/release-info/release-notes.md

Comment thread docs/release-info/release-notes.md Outdated

Comment thread docs/release-info/release-notes.md Outdated

Comment thread docs/release-info/upgrade-notes.md

317brian and others added 2 commits October 3, 2024 12:46

fix typos

b8652a0

Co-authored-by: Kashif Faraz <kashif.faraz@gmail.com>

add dart blurb from druid blog

e38af70

abhishekrb19 reviewed Oct 3, 2024

View reviewed changes

Comment thread docs/release-info/release-notes.md Outdated

Comment thread docs/release-info/release-notes.md Outdated

techdocsmith and others added 8 commits October 7, 2024 15:52

[Docs] Release notes from Oct 2 batch and from Milestone scrape (#74)

762dda2

docs: More release notes for Druid 31 (#73)

4eb68d9

Co-authored-by: 317brian <53799971+317brian@users.noreply.github.com>

address comments

ed6f734

fixes and some new entries

2e11767

Merge branch '31.0.0' into 31-release-notes

847e84b

update highlights

9bf0b80

Docs: more release notes for Druid 31 (#75)

9cea3f5

fix link

ca129c4

adarshsanjeev reviewed Oct 8, 2024

View reviewed changes

Comment thread docs/release-info/release-notes.md Outdated

Akshat-Jain reviewed Oct 8, 2024

View reviewed changes

317brian and others added 3 commits October 8, 2024 14:19

review comments

2bb0aa3

fixes

ebc5e8c

Druid 31 release notes updates (#76)

4f4f1ca

317brian added 2 commits October 8, 2024 17:05

fixes

1ead3b2

fix typos

6449c97

LakshSingla reviewed Oct 9, 2024

View reviewed changes

techdocsmith and others added 2 commits October 9, 2024 13:26

[Docs] 31 web console rn (#77)

6d9dfaa

fix typos

34a0876

317brian reviewed Oct 10, 2024

View reviewed changes

Comment thread docs/release-info/upgrade-notes.md Outdated

Update docs/release-info/upgrade-notes.md

5fd323a

clintropolis reviewed Oct 11, 2024

View reviewed changes

317brian added 2 commits October 11, 2024 13:06

Merge branch '31.0.0' into 31-release-notes

f9e0e40

add projections

1207be3

317brian marked this pull request as ready for review October 15, 2024 18:56

317brian added 2 commits October 15, 2024 12:47

add missing period

43c03db

fix typos

cccba1a

317brian reviewed Oct 15, 2024

View reviewed changes

Comment thread docs/release-info/release-notes.md Outdated

Comment thread docs/release-info/release-notes.md Outdated

add 16291 to autocompaction stuff

a98b604

kfaraz reviewed Oct 29, 2024

View reviewed changes

Comment thread docs/release-info/release-notes.md Outdated

Fix spelling of accepts

d0adccc

kfaraz merged commit e972706 into apache:31.0.0 Oct 29, 2024

1gtm pushed a commit to appscode-images/druid that referenced this pull request Mar 12, 2025

Druid 31.0.0 release notes (apache#17092)

d280572


		This section contains important information about new and existing features.

		### Compaction on MSQ


		### Projections (experimental)

		Druid 31.0.0 includes experimental support for projections in segments. Like materialized views, projections can improve the performance of queries by optimizing the route the query takes when it executes.

Conversation

writer-jill commented Sep 17, 2024

Uh oh!

abhishekagarwal87 Oct 1, 2024

Choose a reason for hiding this comment

Uh oh!

writer-jill Oct 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adarshsanjeev Oct 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Akshat-Jain Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

Akshat-Jain Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

317brian Oct 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Akshat-Jain Oct 9, 2024

Choose a reason for hiding this comment

Uh oh!

Akshat-Jain Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

317brian Oct 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LakshSingla left a comment

Choose a reason for hiding this comment

Uh oh!

317brian commented Oct 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

clintropolis Oct 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

clintropolis Oct 11, 2024

Choose a reason for hiding this comment

Uh oh!

317brian Oct 11, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

writer-jill Oct 2, 2024 •

edited

Loading

adarshsanjeev Oct 8, 2024 •

edited

Loading

317brian Oct 8, 2024 •

edited

Loading

317brian Oct 9, 2024 •

edited

Loading

317brian commented Oct 9, 2024 •

edited

Loading

clintropolis Oct 11, 2024 •

edited

Loading