build: Add basic CI test pipelines by sunchao · Pull Request #18 · apache/datafusion-comet

sunchao · 2024-02-13T22:03:20Z

This adds some basic CI test pipelines for the project. Basically run tests in the Rust and Java/Scala side within the repo.

Closes #7

sunchao · 2024-02-13T22:04:19Z

Need to find a way to test this. Also this only run tests in Linux for Spark 3.4 atm, and we need more combinations.

sunchao · 2024-02-14T05:39:39Z

@viirya @andygrove this is ready now. Tested here.

sunchao · 2024-02-14T05:40:12Z

    /// See [`object_panic_exception`] for a test which involves generating a panic and verifying
    /// that the resulting stack trace includes the offending call.
    #[test]
+    #[ignore]


the golden file for this test is not added yet, so ignore for now

sunchao · 2024-02-14T05:40:44Z


  tpcdsQueries.foreach { q =>
-    test(s"check simplified (tpcds-v1.4/$q)") {
+    ignore(s"check simplified (tpcds-v1.4/$q)") {


depends on #10

viirya · 2024-02-14T06:12:13Z

+        run: |
+          cd core
+          # This is required to run some JNI related tests on the Rust side
+          LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$JAVA_HOME/lib:$JAVA_HOME/lib/server cargo test


RUST_BACKTRACE=1?

viirya · 2024-02-14T06:13:47Z

+
+      - name: Run tests
+        run: |
+          SPARK_HOME=`pwd` ./mvnw clean install


Hmm, we use ./mvnw verify in Makefile, do we need to do that?

Don't we need BOSON_CONF_DIR?

install covers verify: https://maven.apache.org/guides/introduction/introduction-to-the-lifecycle.html#a-build-lifecycle-is-made-up-of-phases

COMET_CONF_DIR is optional and by default, Rust test outputs are directed to stdout.

viirya · 2024-02-14T06:17:35Z

There are a log message like this:

Downloaded from central: https://repo.maven.apache.org/maven2/org/codehaus/plexus/plexus-utils/3.0.22/plexus-utils-3.0.22.jar (245 kB at 2.3 MB/s)
Progress (1): 1.0/2.3 MB
Progress (1): 1.1/2.3 MB
Progress (1): 1.1/2.3 MB
...

It is really distracting for looking at the logs. Can we silent it?

sunchao · 2024-02-14T06:22:44Z

It is really distracting for looking at the logs. Can we silent it?

Hmm perhaps we can cache the Maven dependencies so these will only show up in the first time. Let me check

sunchao · 2024-02-14T15:40:31Z

Thanks! merged. Let me check if the Maven cache did work. I'll create followups if it doesn't.

Fixes all 14 previously-deferred review findings: apache#4 Case-sensitivity in DV column detection: isDeltaDvFilterPattern and findAndStripDeltaScanBelow now use equalsIgnoreCase for the __delta_internal_is_row_deleted column name match. apache#8 S3 key documentation: added comment in JNI documenting the Hadoop-style key names that storageOptions carries and how extract_storage_config maps them. apache#10 Proto comment inaccuracy: updated reserved field number comments to describe purpose rather than referencing (now-stale) phase numbers. Added field numbering strategy note on DeltaScanCommon. apache#11 Module quarantine docs: updated delta/mod.rs doc comment to note that create_object_store returns Arc<dyn ObjectStore> from object_store_kernel 0.12, and that it never escapes the module. Updated public API listing to match current exports. apache#12 Optimizer rule double-init: added synchronized double-checked locking on the CometDeltaDvConfigRule to prevent concurrent threads from racing on the config set. apache#14 Incomplete partition type support: castPartitionString now throws IllegalArgumentException for unsupported types (STRUCT, ARRAY, MAP, etc.) instead of silently converting to UTF8String. apache#6 DV materialization clarity: added comment explaining why .unwrap_or_default() is safe (get_row_indexes returns Ok(None) only if has_vector() lied, which kernel guarantees doesn't happen; Err propagates via ?). apache#17 Consistent JNI null handling: extracted read_string_array helper for reading Java String[] into Vec<String>, consolidating the null-check + iteration pattern. apache#18-19 Proto field ordering: added numbering strategy comment to DeltaScanCommon and DeltaScanTask messages. apache#20 Memory note: added comment about potential driver OOM on extremely large tables (millions of files) with suggestion for future streaming/chunked processing. Tests: succeeded 35, failed 0, canceled 0, ignored 0, pending 0 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

sunchao force-pushed the add-ci branch 7 times, most recently from 9a6cd49 to 1d82548 Compare February 14, 2024 05:11

sunchao marked this pull request as ready for review February 14, 2024 05:38

sunchao commented Feb 14, 2024

View reviewed changes

initial commit

a4cb7bf

sunchao force-pushed the add-ci branch from 1d82548 to a4cb7bf Compare February 14, 2024 05:41

viirya reviewed Feb 14, 2024

View reviewed changes

viirya approved these changes Feb 14, 2024

View reviewed changes

more

0e1cffb

sunchao force-pushed the add-ci branch from 525d951 to 0e1cffb Compare February 14, 2024 07:28

andygrove approved these changes Feb 14, 2024

View reviewed changes

sunchao merged commit 311ef6b into apache:main Feb 14, 2024

sunchao deleted the add-ci branch February 14, 2024 15:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

build: Add basic CI test pipelines#18

build: Add basic CI test pipelines#18
sunchao merged 2 commits intoapache:mainfrom
sunchao:add-ci

sunchao commented Feb 13, 2024 •

edited

Loading

Uh oh!

sunchao commented Feb 13, 2024

Uh oh!

sunchao commented Feb 14, 2024

Uh oh!

sunchao Feb 14, 2024

Uh oh!

sunchao Feb 14, 2024

Uh oh!

viirya Feb 14, 2024

Uh oh!

sunchao Feb 14, 2024

Uh oh!

viirya Feb 14, 2024

Uh oh!

viirya Feb 14, 2024 •

edited

Loading

Uh oh!

sunchao Feb 14, 2024 •

edited

Loading

Uh oh!

viirya commented Feb 14, 2024

Uh oh!

sunchao commented Feb 14, 2024

Uh oh!

sunchao commented Feb 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sunchao commented Feb 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sunchao commented Feb 13, 2024

Uh oh!

sunchao commented Feb 14, 2024

Uh oh!

sunchao Feb 14, 2024

Choose a reason for hiding this comment

Uh oh!

sunchao Feb 14, 2024

Choose a reason for hiding this comment

Uh oh!

viirya Feb 14, 2024

Choose a reason for hiding this comment

Uh oh!

sunchao Feb 14, 2024

Choose a reason for hiding this comment

Uh oh!

viirya Feb 14, 2024

Choose a reason for hiding this comment

Uh oh!

viirya Feb 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sunchao Feb 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

viirya commented Feb 14, 2024

Uh oh!

sunchao commented Feb 14, 2024

Uh oh!

sunchao commented Feb 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sunchao commented Feb 13, 2024 •

edited

Loading

viirya Feb 14, 2024 •

edited

Loading

sunchao Feb 14, 2024 •

edited

Loading