Skip to content

Conversation

@asolimando
Copy link
Member

@asolimando asolimando commented May 23, 2022

What changes were proposed in this pull request?

We add a vectorized implementation for the ds_kll_sketch UDAF

Why are the changes needed?

When this UDAF is used either alone or at the side of other vectorizable functions, it will benefit from a performance speed-up.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

mvn test -Dtest=TestMiniLlapLocalCliDriver -Dqfile="compute_kll_sketch.q" -Dtest.output.overwrite -pl itests/qtest -Pitests

and

mvn test -Dtest=TestMiniLlapLocalCliDriver -Dqfile="sketches_rewrite_rank_partition_by.q,sketches_rewrite_rank.q,sketches_rewrite_percentile_disc.q,sketches_rewrite_ntile_partition_by.q,sketches_rewrite_ntile.q,sketches_rewrite_cume_dist_partition_by.q,sketches_rewrite_cume_dist.q,sketches_materialized_view_rank.q,sketches_materialized_view_percentile_disc.q,sketches_materialized_view_ntile.q,sketches_materialized_view_cume_dist.q" -Dtest.output.overwrite -pl itests/qtest -Pitests

@github-actions github-actions bot requested a review from kgyrtkirk May 23, 2022 17:17
@asolimando asolimando changed the title Master hive 26243 ds kll sketch vectorized HIVE-26243 ds kll sketch vectorized May 24, 2022
@asolimando asolimando force-pushed the master-HIVE-26243-ds_kll_sketch_vectorized branch from f9622cd to a610bf7 Compare May 24, 2022 11:53
@asolimando asolimando force-pushed the master-HIVE-26243-ds_kll_sketch_vectorized branch from a610bf7 to 266c500 Compare May 25, 2022 12:20
@asolimando
Copy link
Member Author

Before going into the single discussions, the general answer to all the above comments boils down to "I am trying to keep consistency with what was done here for vectorizing HyperLogLog function": https://github.com/apache/hive/pull/1824/files

I sense that you don't like how that PR was designed, but since they are very close in spirit, and that their code is used side by side, I thought it was important to keep them consistent.

If we need to rework the current PR, they won't match anymore, unless we rework the HLL design and implementation too, and this has its own share of cons...

Assuming we go for the refactoring, most of the comments are too sketchy to give appropriate guidance over an alternative design/implementation, I will need to ask you to elaborate more on them.

For instance, you seem to be suggesting to remove all helper classes/methods etc. Since it does not seem feasible to inline all the code now sitting in the helper methods/classes directly in the vectorized implementation, I guess you want to place it someplace else, but I can't really decide based on your comment.

For the couple of currently unused methods, I will need them in a PR depending on this one: https://issues.apache.org/jira/browse/HIVE-26221: I can remove them now and re-introduce them later, if preferable. Once again they mimic HLL methods (both naming and usage, since HLL and KLL methods will be used side by side in most places, it helps reading what's happening, see LongColumnStatsAggregator.java#L104-L111, for instance).

@kgyrtkirk
Copy link
Member

PR-1824 has nothing to do with datasketches; I don't know how you followed it's conventions but you might end up in trouble...because DS also has a HLL implementation...wouldn't that conflict with the existing one?

note: PR-1824 named the file VectorUDAFComputeBitVector.txt and internally named the method compute_bit_vector_hll ; I think the class name should have contained the Hll keyword

I think that the file name VectorUDAFComputeKLL.txt is not connected at all to the ds_kll_sketch function its about to vectorize...and as such its a bit confusing....

The current implementation doesn't really look forward: I think we have 20 sketch function from datasketches already exposed as inside Hive which could be vectorized; I think they are behind the same api cover...so just vectorizing the KLL one without any sight forward and taking "ideas" from the old hll codepath doesn't seem the best idea to me...

grep ^ds_ ql/src/test/results/clientpositive/llap/show_functions.q.out|grep _sketch$

no need to do everything in 1 patch - but this is pretty much just copy-pasting the existing hll txtfile substituted to kll here and there...so we should do that 20 times?

For instance, you seem to be suggesting to remove all helper classes/methods etc
I don't think those changes neccessary in the metastore for a vectorization of this function?

HIVE-26221 is something which have changes - but has no real end-user accessible value - and as such I don't think its ready.

@asolimando asolimando force-pushed the master-HIVE-26243-ds_kll_sketch_vectorized branch from a2404e3 to e2f02ea Compare November 19, 2022 10:13
@asolimando
Copy link
Member Author

Rebasing on master

@sonarqubecloud
Copy link

Kudos, SonarCloud Quality Gate passed!    Quality Gate passed

Bug A 0 Bugs
Vulnerability A 0 Vulnerabilities
Security Hotspot A 0 Security Hotspots
Code Smell A 3 Code Smells

No Coverage information No Coverage information
No Duplication information No Duplication information

@asolimando
Copy link
Member Author

@deniskuzZ tests are green, ready to be merged when you have a moment, thanks!

@deniskuzZ deniskuzZ merged commit ad19ec3 into apache:master Nov 21, 2022
@asolimando asolimando deleted the master-HIVE-26243-ds_kll_sketch_vectorized branch November 21, 2022 09:07
dengzhhu653 pushed a commit to dengzhhu653/hive that referenced this pull request Dec 15, 2022
… (Alessandro Solimando, reviewed by Denys Kuzmenko, Zoltan Haindrich)

Closes apache#3317
DongWei-4 pushed a commit to DongWei-4/hive that referenced this pull request Dec 29, 2022
… (Alessandro Solimando, reviewed by Denys Kuzmenko, Zoltan Haindrich)

Closes apache#3317

(cherry picked from commit ad19ec3)
InvisibleProgrammer pushed a commit to InvisibleProgrammer/hive that referenced this pull request Jul 13, 2023
Comment out classes to work on to be able to build the module

Refator TestCopyUtils.java

Before refactor: TestReplDumpTask

Refactor TestReplDumpTask.java

Before refactor: TestAtlasLoadTask

Refactor TestAtlasLoadTask.java

Refator TestRangerDumpTask.java

Before refactor: TestCompactionHeartbeatService

Refactor TestCompactionHeartbeatService.java

Before refactor: TestRetryable

Refactor TestRetryable.java

Before refactor: TestRangerLoadTask

Refactor: RangerLoadTask

Before refactor: TestAtlasDumpTask

Refactor: AtlasDumpTask

Before refactor: TestPrimaryToReplicaResourceFunction

Refactor TestPrimaryToReplicaResourceFunction

Before refactor: TestExportService

Refactor: TestExportService

HIVE-26522: Added test for HIVE-22033 regarding delegation token renewal (apache#3585)

HIVE-26676: Count distinct in subquery returning wrong results (Steve Carlin, reviewed by Alessandro Solimando, Aman Sinha, Krisztian Kasa)

HIVE-26736: Authorization failure for nested Views having WITH clause. (apache#3760). (Ayush Saxena, reviewed by Denys Kuzmenko)

HIVE-26628: Iceberg table is created when running explain ctas command (Krisztian Kasa, reviewed by Denys Kuzmenko)

HIVE-26734: Iceberg: Add an option to allow positional delete files without actual row data. (apache#3758). (Ayush Saxena, reviewed by Adam Szita, Denys Kuzmenko)

HIVE-26524: Use Calcite to remove sections of a query plan known never produces rows - ADDENDUM (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26740: HS2 makes direct connections to HMS backend DB due to Compaction/StatsUpdater (apache#3765) (Adam Szita, reviewed by Zhihua Deng)

HIVE-26631: Remove unused Thrift config parameters login.timeout and exponential.backoff.slot.length (xiuzhu9527 reviewed by Stamatis Zampetakis)

Closes apache#3672

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis) ADDENDUM

HIVE-26745: HPL unable to handle Decimal or null values in hplsql mode (apache#3769) (Adam Szita, reviewed by Attila Magyar and Denys Kuzmenko)

HIVE-26722: HiveFilterSetOpTransposeRule incorrectly prunes UNION ALL operands.  (apache#3748). (Alessandro Solimando, reviewed by Ayush Saxena, Simhadri Govindappa)

HIVE-26746: Request tracking: change to X-Request-ID header (apache#3770) (Laszlo Bodor reviewed by Zhihua Deng)

HIVE-26624: Set repl.background.enable on target after failover completion (Vinit Patni, reviewed by László Pintér, Teddy Choi)

Co-authored-by: vpatni <vpatni@cloudera.com>

HIVE-26712: HCatMapReduceTest writes test files in project base directory instead of build directory. (apache#3738) (Chris Nauroth reviewed by Ayush Saxena)

HIVE-26726: Tinyint column with windowing fn crashes at runtime (Steve Carlin, reviewed by Aman Sinha, Krisztian Kasa)

HIVE-26680: Make CMV use Direct Insert Semantics (Sourabh Badhya, reviewed by Denys Kuzmenko, Laszlo Vegh)

Closes apache#3715

HIVE-26243: Add vectorized implementation of the 'ds_kll_sketch' UDAF (Alessandro Solimando, reviewed by Denys Kuzmenko, Zoltan Haindrich)

Closes apache#3317

HIVE-26761: Add result sorting to complex_alias.q (apache#3783) (Balazs Cseh reviewed by Laszlo Bodor)

HIVE-26759: Update SHOW COMPACTIONS query to support Postgres HMS (Akshat Mathur, reviewed by Denys Kuzmenko, Zsolt Miskolczi)

Closes 3782

HIVE-26765: Hive Ranger URL policy for insert overwrite directory denies access when fully qualified paths are passed (apache#3790) (Simhadri Govindappa, reviewed by Adam Szita)

Small refactors

Fix bug
InvisibleProgrammer pushed a commit to InvisibleProgrammer/hive that referenced this pull request Jul 13, 2023
Comment out classes to work on to be able to build the module

Refator TestCopyUtils.java

Before refactor: TestReplDumpTask

Refactor TestReplDumpTask.java

Before refactor: TestAtlasLoadTask

Refactor TestAtlasLoadTask.java

Refator TestRangerDumpTask.java

Before refactor: TestCompactionHeartbeatService

Refactor TestCompactionHeartbeatService.java

Before refactor: TestRetryable

Refactor TestRetryable.java

Before refactor: TestRangerLoadTask

Refactor: RangerLoadTask

Before refactor: TestAtlasDumpTask

Refactor: AtlasDumpTask

Before refactor: TestPrimaryToReplicaResourceFunction

Refactor TestPrimaryToReplicaResourceFunction

Before refactor: TestExportService

Refactor: TestExportService

HIVE-26522: Added test for HIVE-22033 regarding delegation token renewal (apache#3585)

HIVE-26676: Count distinct in subquery returning wrong results (Steve Carlin, reviewed by Alessandro Solimando, Aman Sinha, Krisztian Kasa)

HIVE-26736: Authorization failure for nested Views having WITH clause. (apache#3760). (Ayush Saxena, reviewed by Denys Kuzmenko)

HIVE-26628: Iceberg table is created when running explain ctas command (Krisztian Kasa, reviewed by Denys Kuzmenko)

HIVE-26734: Iceberg: Add an option to allow positional delete files without actual row data. (apache#3758). (Ayush Saxena, reviewed by Adam Szita, Denys Kuzmenko)

HIVE-26524: Use Calcite to remove sections of a query plan known never produces rows - ADDENDUM (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26740: HS2 makes direct connections to HMS backend DB due to Compaction/StatsUpdater (apache#3765) (Adam Szita, reviewed by Zhihua Deng)

HIVE-26631: Remove unused Thrift config parameters login.timeout and exponential.backoff.slot.length (xiuzhu9527 reviewed by Stamatis Zampetakis)

Closes apache#3672

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis) ADDENDUM

HIVE-26745: HPL unable to handle Decimal or null values in hplsql mode (apache#3769) (Adam Szita, reviewed by Attila Magyar and Denys Kuzmenko)

HIVE-26722: HiveFilterSetOpTransposeRule incorrectly prunes UNION ALL operands.  (apache#3748). (Alessandro Solimando, reviewed by Ayush Saxena, Simhadri Govindappa)

HIVE-26746: Request tracking: change to X-Request-ID header (apache#3770) (Laszlo Bodor reviewed by Zhihua Deng)

HIVE-26624: Set repl.background.enable on target after failover completion (Vinit Patni, reviewed by László Pintér, Teddy Choi)

Co-authored-by: vpatni <vpatni@cloudera.com>

HIVE-26712: HCatMapReduceTest writes test files in project base directory instead of build directory. (apache#3738) (Chris Nauroth reviewed by Ayush Saxena)

HIVE-26726: Tinyint column with windowing fn crashes at runtime (Steve Carlin, reviewed by Aman Sinha, Krisztian Kasa)

HIVE-26680: Make CMV use Direct Insert Semantics (Sourabh Badhya, reviewed by Denys Kuzmenko, Laszlo Vegh)

Closes apache#3715

HIVE-26243: Add vectorized implementation of the 'ds_kll_sketch' UDAF (Alessandro Solimando, reviewed by Denys Kuzmenko, Zoltan Haindrich)

Closes apache#3317

HIVE-26761: Add result sorting to complex_alias.q (apache#3783) (Balazs Cseh reviewed by Laszlo Bodor)

HIVE-26759: Update SHOW COMPACTIONS query to support Postgres HMS (Akshat Mathur, reviewed by Denys Kuzmenko, Zsolt Miskolczi)

Closes 3782

HIVE-26765: Hive Ranger URL policy for insert overwrite directory denies access when fully qualified paths are passed (apache#3790) (Simhadri Govindappa, reviewed by Adam Szita)

Small refactors

Fix bug
InvisibleProgrammer pushed a commit to InvisibleProgrammer/hive that referenced this pull request Jul 14, 2023
Comment out classes to work on to be able to build the module

Refator TestCopyUtils.java

Before refactor: TestReplDumpTask

Refactor TestReplDumpTask.java

Before refactor: TestAtlasLoadTask

Refactor TestAtlasLoadTask.java

Refator TestRangerDumpTask.java

Before refactor: TestCompactionHeartbeatService

Refactor TestCompactionHeartbeatService.java

Before refactor: TestRetryable

Refactor TestRetryable.java

Before refactor: TestRangerLoadTask

Refactor: RangerLoadTask

Before refactor: TestAtlasDumpTask

Refactor: AtlasDumpTask

Before refactor: TestPrimaryToReplicaResourceFunction

Refactor TestPrimaryToReplicaResourceFunction

Before refactor: TestExportService

Refactor: TestExportService

HIVE-26522: Added test for HIVE-22033 regarding delegation token renewal (apache#3585)

HIVE-26676: Count distinct in subquery returning wrong results (Steve Carlin, reviewed by Alessandro Solimando, Aman Sinha, Krisztian Kasa)

HIVE-26736: Authorization failure for nested Views having WITH clause. (apache#3760). (Ayush Saxena, reviewed by Denys Kuzmenko)

HIVE-26628: Iceberg table is created when running explain ctas command (Krisztian Kasa, reviewed by Denys Kuzmenko)

HIVE-26734: Iceberg: Add an option to allow positional delete files without actual row data. (apache#3758). (Ayush Saxena, reviewed by Adam Szita, Denys Kuzmenko)

HIVE-26524: Use Calcite to remove sections of a query plan known never produces rows - ADDENDUM (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26740: HS2 makes direct connections to HMS backend DB due to Compaction/StatsUpdater (apache#3765) (Adam Szita, reviewed by Zhihua Deng)

HIVE-26631: Remove unused Thrift config parameters login.timeout and exponential.backoff.slot.length (xiuzhu9527 reviewed by Stamatis Zampetakis)

Closes apache#3672

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis) ADDENDUM

HIVE-26745: HPL unable to handle Decimal or null values in hplsql mode (apache#3769) (Adam Szita, reviewed by Attila Magyar and Denys Kuzmenko)

HIVE-26722: HiveFilterSetOpTransposeRule incorrectly prunes UNION ALL operands.  (apache#3748). (Alessandro Solimando, reviewed by Ayush Saxena, Simhadri Govindappa)

HIVE-26746: Request tracking: change to X-Request-ID header (apache#3770) (Laszlo Bodor reviewed by Zhihua Deng)

HIVE-26624: Set repl.background.enable on target after failover completion (Vinit Patni, reviewed by László Pintér, Teddy Choi)

Co-authored-by: vpatni <vpatni@cloudera.com>

HIVE-26712: HCatMapReduceTest writes test files in project base directory instead of build directory. (apache#3738) (Chris Nauroth reviewed by Ayush Saxena)

HIVE-26726: Tinyint column with windowing fn crashes at runtime (Steve Carlin, reviewed by Aman Sinha, Krisztian Kasa)

HIVE-26680: Make CMV use Direct Insert Semantics (Sourabh Badhya, reviewed by Denys Kuzmenko, Laszlo Vegh)

Closes apache#3715

HIVE-26243: Add vectorized implementation of the 'ds_kll_sketch' UDAF (Alessandro Solimando, reviewed by Denys Kuzmenko, Zoltan Haindrich)

Closes apache#3317

HIVE-26761: Add result sorting to complex_alias.q (apache#3783) (Balazs Cseh reviewed by Laszlo Bodor)

HIVE-26759: Update SHOW COMPACTIONS query to support Postgres HMS (Akshat Mathur, reviewed by Denys Kuzmenko, Zsolt Miskolczi)

Closes 3782

HIVE-26765: Hive Ranger URL policy for insert overwrite directory denies access when fully qualified paths are passed (apache#3790) (Simhadri Govindappa, reviewed by Adam Szita)

Small refactors

Fix bug
InvisibleProgrammer pushed a commit to InvisibleProgrammer/hive that referenced this pull request Jul 19, 2023
Comment out classes to work on to be able to build the module

Refator TestCopyUtils.java

Before refactor: TestReplDumpTask

Refactor TestReplDumpTask.java

Before refactor: TestAtlasLoadTask

Refactor TestAtlasLoadTask.java

Refator TestRangerDumpTask.java

Before refactor: TestCompactionHeartbeatService

Refactor TestCompactionHeartbeatService.java

Before refactor: TestRetryable

Refactor TestRetryable.java

Before refactor: TestRangerLoadTask

Refactor: RangerLoadTask

Before refactor: TestAtlasDumpTask

Refactor: AtlasDumpTask

Before refactor: TestPrimaryToReplicaResourceFunction

Refactor TestPrimaryToReplicaResourceFunction

Before refactor: TestExportService

Refactor: TestExportService

HIVE-26522: Added test for HIVE-22033 regarding delegation token renewal (apache#3585)

HIVE-26676: Count distinct in subquery returning wrong results (Steve Carlin, reviewed by Alessandro Solimando, Aman Sinha, Krisztian Kasa)

HIVE-26736: Authorization failure for nested Views having WITH clause. (apache#3760). (Ayush Saxena, reviewed by Denys Kuzmenko)

HIVE-26628: Iceberg table is created when running explain ctas command (Krisztian Kasa, reviewed by Denys Kuzmenko)

HIVE-26734: Iceberg: Add an option to allow positional delete files without actual row data. (apache#3758). (Ayush Saxena, reviewed by Adam Szita, Denys Kuzmenko)

HIVE-26524: Use Calcite to remove sections of a query plan known never produces rows - ADDENDUM (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26740: HS2 makes direct connections to HMS backend DB due to Compaction/StatsUpdater (apache#3765) (Adam Szita, reviewed by Zhihua Deng)

HIVE-26631: Remove unused Thrift config parameters login.timeout and exponential.backoff.slot.length (xiuzhu9527 reviewed by Stamatis Zampetakis)

Closes apache#3672

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis) ADDENDUM

HIVE-26745: HPL unable to handle Decimal or null values in hplsql mode (apache#3769) (Adam Szita, reviewed by Attila Magyar and Denys Kuzmenko)

HIVE-26722: HiveFilterSetOpTransposeRule incorrectly prunes UNION ALL operands.  (apache#3748). (Alessandro Solimando, reviewed by Ayush Saxena, Simhadri Govindappa)

HIVE-26746: Request tracking: change to X-Request-ID header (apache#3770) (Laszlo Bodor reviewed by Zhihua Deng)

HIVE-26624: Set repl.background.enable on target after failover completion (Vinit Patni, reviewed by László Pintér, Teddy Choi)

Co-authored-by: vpatni <vpatni@cloudera.com>

HIVE-26712: HCatMapReduceTest writes test files in project base directory instead of build directory. (apache#3738) (Chris Nauroth reviewed by Ayush Saxena)

HIVE-26726: Tinyint column with windowing fn crashes at runtime (Steve Carlin, reviewed by Aman Sinha, Krisztian Kasa)

HIVE-26680: Make CMV use Direct Insert Semantics (Sourabh Badhya, reviewed by Denys Kuzmenko, Laszlo Vegh)

Closes apache#3715

HIVE-26243: Add vectorized implementation of the 'ds_kll_sketch' UDAF (Alessandro Solimando, reviewed by Denys Kuzmenko, Zoltan Haindrich)

Closes apache#3317

HIVE-26761: Add result sorting to complex_alias.q (apache#3783) (Balazs Cseh reviewed by Laszlo Bodor)

HIVE-26759: Update SHOW COMPACTIONS query to support Postgres HMS (Akshat Mathur, reviewed by Denys Kuzmenko, Zsolt Miskolczi)

Closes 3782

HIVE-26765: Hive Ranger URL policy for insert overwrite directory denies access when fully qualified paths are passed (apache#3790) (Simhadri Govindappa, reviewed by Adam Szita)

Small refactors

Fix bug
yeahyung pushed a commit to yeahyung/hive that referenced this pull request Jul 20, 2023
… (Alessandro Solimando, reviewed by Denys Kuzmenko, Zoltan Haindrich)

Closes apache#3317
InvisibleProgrammer pushed a commit to InvisibleProgrammer/hive that referenced this pull request Aug 7, 2023
Comment out classes to work on to be able to build the module

Refator TestCopyUtils.java

Before refactor: TestReplDumpTask

Refactor TestReplDumpTask.java

Before refactor: TestAtlasLoadTask

Refactor TestAtlasLoadTask.java

Refator TestRangerDumpTask.java

Before refactor: TestCompactionHeartbeatService

Refactor TestCompactionHeartbeatService.java

Before refactor: TestRetryable

Refactor TestRetryable.java

Before refactor: TestRangerLoadTask

Refactor: RangerLoadTask

Before refactor: TestAtlasDumpTask

Refactor: AtlasDumpTask

Before refactor: TestPrimaryToReplicaResourceFunction

Refactor TestPrimaryToReplicaResourceFunction

Before refactor: TestExportService

Refactor: TestExportService

HIVE-26522: Added test for HIVE-22033 regarding delegation token renewal (apache#3585)

HIVE-26676: Count distinct in subquery returning wrong results (Steve Carlin, reviewed by Alessandro Solimando, Aman Sinha, Krisztian Kasa)

HIVE-26736: Authorization failure for nested Views having WITH clause. (apache#3760). (Ayush Saxena, reviewed by Denys Kuzmenko)

HIVE-26628: Iceberg table is created when running explain ctas command (Krisztian Kasa, reviewed by Denys Kuzmenko)

HIVE-26734: Iceberg: Add an option to allow positional delete files without actual row data. (apache#3758). (Ayush Saxena, reviewed by Adam Szita, Denys Kuzmenko)

HIVE-26524: Use Calcite to remove sections of a query plan known never produces rows - ADDENDUM (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26740: HS2 makes direct connections to HMS backend DB due to Compaction/StatsUpdater (apache#3765) (Adam Szita, reviewed by Zhihua Deng)

HIVE-26631: Remove unused Thrift config parameters login.timeout and exponential.backoff.slot.length (xiuzhu9527 reviewed by Stamatis Zampetakis)

Closes apache#3672

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis) ADDENDUM

HIVE-26745: HPL unable to handle Decimal or null values in hplsql mode (apache#3769) (Adam Szita, reviewed by Attila Magyar and Denys Kuzmenko)

HIVE-26722: HiveFilterSetOpTransposeRule incorrectly prunes UNION ALL operands.  (apache#3748). (Alessandro Solimando, reviewed by Ayush Saxena, Simhadri Govindappa)

HIVE-26746: Request tracking: change to X-Request-ID header (apache#3770) (Laszlo Bodor reviewed by Zhihua Deng)

HIVE-26624: Set repl.background.enable on target after failover completion (Vinit Patni, reviewed by László Pintér, Teddy Choi)

Co-authored-by: vpatni <vpatni@cloudera.com>

HIVE-26712: HCatMapReduceTest writes test files in project base directory instead of build directory. (apache#3738) (Chris Nauroth reviewed by Ayush Saxena)

HIVE-26726: Tinyint column with windowing fn crashes at runtime (Steve Carlin, reviewed by Aman Sinha, Krisztian Kasa)

HIVE-26680: Make CMV use Direct Insert Semantics (Sourabh Badhya, reviewed by Denys Kuzmenko, Laszlo Vegh)

Closes apache#3715

HIVE-26243: Add vectorized implementation of the 'ds_kll_sketch' UDAF (Alessandro Solimando, reviewed by Denys Kuzmenko, Zoltan Haindrich)

Closes apache#3317

HIVE-26761: Add result sorting to complex_alias.q (apache#3783) (Balazs Cseh reviewed by Laszlo Bodor)

HIVE-26759: Update SHOW COMPACTIONS query to support Postgres HMS (Akshat Mathur, reviewed by Denys Kuzmenko, Zsolt Miskolczi)

Closes 3782

HIVE-26765: Hive Ranger URL policy for insert overwrite directory denies access when fully qualified paths are passed (apache#3790) (Simhadri Govindappa, reviewed by Adam Szita)

Small refactors

Fix bug
InvisibleProgrammer pushed a commit to InvisibleProgrammer/hive that referenced this pull request Sep 11, 2023
Comment out classes to work on to be able to build the module

Refator TestCopyUtils.java

Before refactor: TestReplDumpTask

Refactor TestReplDumpTask.java

Before refactor: TestAtlasLoadTask

Refactor TestAtlasLoadTask.java

Refator TestRangerDumpTask.java

Before refactor: TestCompactionHeartbeatService

Refactor TestCompactionHeartbeatService.java

Before refactor: TestRetryable

Refactor TestRetryable.java

Before refactor: TestRangerLoadTask

Refactor: RangerLoadTask

Before refactor: TestAtlasDumpTask

Refactor: AtlasDumpTask

Before refactor: TestPrimaryToReplicaResourceFunction

Refactor TestPrimaryToReplicaResourceFunction

Before refactor: TestExportService

Refactor: TestExportService

HIVE-26522: Added test for HIVE-22033 regarding delegation token renewal (apache#3585)

HIVE-26676: Count distinct in subquery returning wrong results (Steve Carlin, reviewed by Alessandro Solimando, Aman Sinha, Krisztian Kasa)

HIVE-26736: Authorization failure for nested Views having WITH clause. (apache#3760). (Ayush Saxena, reviewed by Denys Kuzmenko)

HIVE-26628: Iceberg table is created when running explain ctas command (Krisztian Kasa, reviewed by Denys Kuzmenko)

HIVE-26734: Iceberg: Add an option to allow positional delete files without actual row data. (apache#3758). (Ayush Saxena, reviewed by Adam Szita, Denys Kuzmenko)

HIVE-26524: Use Calcite to remove sections of a query plan known never produces rows - ADDENDUM (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26740: HS2 makes direct connections to HMS backend DB due to Compaction/StatsUpdater (apache#3765) (Adam Szita, reviewed by Zhihua Deng)

HIVE-26631: Remove unused Thrift config parameters login.timeout and exponential.backoff.slot.length (xiuzhu9527 reviewed by Stamatis Zampetakis)

Closes apache#3672

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis) ADDENDUM

HIVE-26745: HPL unable to handle Decimal or null values in hplsql mode (apache#3769) (Adam Szita, reviewed by Attila Magyar and Denys Kuzmenko)

HIVE-26722: HiveFilterSetOpTransposeRule incorrectly prunes UNION ALL operands.  (apache#3748). (Alessandro Solimando, reviewed by Ayush Saxena, Simhadri Govindappa)

HIVE-26746: Request tracking: change to X-Request-ID header (apache#3770) (Laszlo Bodor reviewed by Zhihua Deng)

HIVE-26624: Set repl.background.enable on target after failover completion (Vinit Patni, reviewed by László Pintér, Teddy Choi)

Co-authored-by: vpatni <vpatni@cloudera.com>

HIVE-26712: HCatMapReduceTest writes test files in project base directory instead of build directory. (apache#3738) (Chris Nauroth reviewed by Ayush Saxena)

HIVE-26726: Tinyint column with windowing fn crashes at runtime (Steve Carlin, reviewed by Aman Sinha, Krisztian Kasa)

HIVE-26680: Make CMV use Direct Insert Semantics (Sourabh Badhya, reviewed by Denys Kuzmenko, Laszlo Vegh)

Closes apache#3715

HIVE-26243: Add vectorized implementation of the 'ds_kll_sketch' UDAF (Alessandro Solimando, reviewed by Denys Kuzmenko, Zoltan Haindrich)

Closes apache#3317

HIVE-26761: Add result sorting to complex_alias.q (apache#3783) (Balazs Cseh reviewed by Laszlo Bodor)

HIVE-26759: Update SHOW COMPACTIONS query to support Postgres HMS (Akshat Mathur, reviewed by Denys Kuzmenko, Zsolt Miskolczi)

Closes 3782

HIVE-26765: Hive Ranger URL policy for insert overwrite directory denies access when fully qualified paths are passed (apache#3790) (Simhadri Govindappa, reviewed by Adam Szita)

Small refactors

Fix bug
InvisibleProgrammer pushed a commit to InvisibleProgrammer/hive that referenced this pull request Sep 11, 2023
Comment out classes to work on to be able to build the module

Refator TestCopyUtils.java

Before refactor: TestReplDumpTask

Refactor TestReplDumpTask.java

Before refactor: TestAtlasLoadTask

Refactor TestAtlasLoadTask.java

Refator TestRangerDumpTask.java

Before refactor: TestCompactionHeartbeatService

Refactor TestCompactionHeartbeatService.java

Before refactor: TestRetryable

Refactor TestRetryable.java

Before refactor: TestRangerLoadTask

Refactor: RangerLoadTask

Before refactor: TestAtlasDumpTask

Refactor: AtlasDumpTask

Before refactor: TestPrimaryToReplicaResourceFunction

Refactor TestPrimaryToReplicaResourceFunction

Before refactor: TestExportService

Refactor: TestExportService

HIVE-26522: Added test for HIVE-22033 regarding delegation token renewal (apache#3585)

HIVE-26676: Count distinct in subquery returning wrong results (Steve Carlin, reviewed by Alessandro Solimando, Aman Sinha, Krisztian Kasa)

HIVE-26736: Authorization failure for nested Views having WITH clause. (apache#3760). (Ayush Saxena, reviewed by Denys Kuzmenko)

HIVE-26628: Iceberg table is created when running explain ctas command (Krisztian Kasa, reviewed by Denys Kuzmenko)

HIVE-26734: Iceberg: Add an option to allow positional delete files without actual row data. (apache#3758). (Ayush Saxena, reviewed by Adam Szita, Denys Kuzmenko)

HIVE-26524: Use Calcite to remove sections of a query plan known never produces rows - ADDENDUM (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26740: HS2 makes direct connections to HMS backend DB due to Compaction/StatsUpdater (apache#3765) (Adam Szita, reviewed by Zhihua Deng)

HIVE-26631: Remove unused Thrift config parameters login.timeout and exponential.backoff.slot.length (xiuzhu9527 reviewed by Stamatis Zampetakis)

Closes apache#3672

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis) ADDENDUM

HIVE-26745: HPL unable to handle Decimal or null values in hplsql mode (apache#3769) (Adam Szita, reviewed by Attila Magyar and Denys Kuzmenko)

HIVE-26722: HiveFilterSetOpTransposeRule incorrectly prunes UNION ALL operands.  (apache#3748). (Alessandro Solimando, reviewed by Ayush Saxena, Simhadri Govindappa)

HIVE-26746: Request tracking: change to X-Request-ID header (apache#3770) (Laszlo Bodor reviewed by Zhihua Deng)

HIVE-26624: Set repl.background.enable on target after failover completion (Vinit Patni, reviewed by László Pintér, Teddy Choi)

Co-authored-by: vpatni <vpatni@cloudera.com>

HIVE-26712: HCatMapReduceTest writes test files in project base directory instead of build directory. (apache#3738) (Chris Nauroth reviewed by Ayush Saxena)

HIVE-26726: Tinyint column with windowing fn crashes at runtime (Steve Carlin, reviewed by Aman Sinha, Krisztian Kasa)

HIVE-26680: Make CMV use Direct Insert Semantics (Sourabh Badhya, reviewed by Denys Kuzmenko, Laszlo Vegh)

Closes apache#3715

HIVE-26243: Add vectorized implementation of the 'ds_kll_sketch' UDAF (Alessandro Solimando, reviewed by Denys Kuzmenko, Zoltan Haindrich)

Closes apache#3317

HIVE-26761: Add result sorting to complex_alias.q (apache#3783) (Balazs Cseh reviewed by Laszlo Bodor)

HIVE-26759: Update SHOW COMPACTIONS query to support Postgres HMS (Akshat Mathur, reviewed by Denys Kuzmenko, Zsolt Miskolczi)

Closes 3782

HIVE-26765: Hive Ranger URL policy for insert overwrite directory denies access when fully qualified paths are passed (apache#3790) (Simhadri Govindappa, reviewed by Adam Szita)

Small refactors

Fix bug

Remove uneccessary constructor

Upgrading mockito in hive-exec
InvisibleProgrammer pushed a commit to InvisibleProgrammer/hive that referenced this pull request Sep 16, 2023
# This is the 1st commit message:

Start to refactor

Comment out classes to work on to be able to build the module

Refator TestCopyUtils.java

Before refactor: TestReplDumpTask

Refactor TestReplDumpTask.java

Before refactor: TestAtlasLoadTask

Refactor TestAtlasLoadTask.java

Refator TestRangerDumpTask.java

Before refactor: TestCompactionHeartbeatService

Refactor TestCompactionHeartbeatService.java

Before refactor: TestRetryable

Refactor TestRetryable.java

Before refactor: TestRangerLoadTask

Refactor: RangerLoadTask

Before refactor: TestAtlasDumpTask

Refactor: AtlasDumpTask

Before refactor: TestPrimaryToReplicaResourceFunction

Refactor TestPrimaryToReplicaResourceFunction

Before refactor: TestExportService

Refactor: TestExportService

HIVE-26522: Added test for HIVE-22033 regarding delegation token renewal (apache#3585)

HIVE-26676: Count distinct in subquery returning wrong results (Steve Carlin, reviewed by Alessandro Solimando, Aman Sinha, Krisztian Kasa)

HIVE-26736: Authorization failure for nested Views having WITH clause. (apache#3760). (Ayush Saxena, reviewed by Denys Kuzmenko)

HIVE-26628: Iceberg table is created when running explain ctas command (Krisztian Kasa, reviewed by Denys Kuzmenko)

HIVE-26734: Iceberg: Add an option to allow positional delete files without actual row data. (apache#3758). (Ayush Saxena, reviewed by Adam Szita, Denys Kuzmenko)

HIVE-26524: Use Calcite to remove sections of a query plan known never produces rows - ADDENDUM (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26740: HS2 makes direct connections to HMS backend DB due to Compaction/StatsUpdater (apache#3765) (Adam Szita, reviewed by Zhihua Deng)

HIVE-26631: Remove unused Thrift config parameters login.timeout and exponential.backoff.slot.length (xiuzhu9527 reviewed by Stamatis Zampetakis)

Closes apache#3672

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis)

HIVE-26747: Remove implementor from HiveRelNode (Krisztian Kasa, reviewed by Stamatis Zampetakis) ADDENDUM

HIVE-26745: HPL unable to handle Decimal or null values in hplsql mode (apache#3769) (Adam Szita, reviewed by Attila Magyar and Denys Kuzmenko)

HIVE-26722: HiveFilterSetOpTransposeRule incorrectly prunes UNION ALL operands.  (apache#3748). (Alessandro Solimando, reviewed by Ayush Saxena, Simhadri Govindappa)

HIVE-26746: Request tracking: change to X-Request-ID header (apache#3770) (Laszlo Bodor reviewed by Zhihua Deng)

HIVE-26624: Set repl.background.enable on target after failover completion (Vinit Patni, reviewed by László Pintér, Teddy Choi)

Co-authored-by: vpatni <vpatni@cloudera.com>

HIVE-26712: HCatMapReduceTest writes test files in project base directory instead of build directory. (apache#3738) (Chris Nauroth reviewed by Ayush Saxena)

HIVE-26726: Tinyint column with windowing fn crashes at runtime (Steve Carlin, reviewed by Aman Sinha, Krisztian Kasa)

HIVE-26680: Make CMV use Direct Insert Semantics (Sourabh Badhya, reviewed by Denys Kuzmenko, Laszlo Vegh)

Closes apache#3715

HIVE-26243: Add vectorized implementation of the 'ds_kll_sketch' UDAF (Alessandro Solimando, reviewed by Denys Kuzmenko, Zoltan Haindrich)

Closes apache#3317

HIVE-26761: Add result sorting to complex_alias.q (apache#3783) (Balazs Cseh reviewed by Laszlo Bodor)

HIVE-26759: Update SHOW COMPACTIONS query to support Postgres HMS (Akshat Mathur, reviewed by Denys Kuzmenko, Zsolt Miskolczi)

Closes 3782

HIVE-26765: Hive Ranger URL policy for insert overwrite directory denies access when fully qualified paths are passed (apache#3790) (Simhadri Govindappa, reviewed by Adam Szita)

Small refactors

Fix bug

# This is the commit message apache#2:

Remove uneccessary constructor
tarak271 pushed a commit to tarak271/hive-1 that referenced this pull request Dec 19, 2023
… (Alessandro Solimando, reviewed by Denys Kuzmenko, Zoltan Haindrich)

Closes apache#3317
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants