Skip to content

[SQL] SPARK-1427 Fix toString for SchemaRDD NativeCommands.#343

Closed
marmbrus wants to merge 1 commit intoapache:masterfrom
marmbrus:toStringFix
Closed

[SQL] SPARK-1427 Fix toString for SchemaRDD NativeCommands.#343
marmbrus wants to merge 1 commit intoapache:masterfrom
marmbrus:toStringFix

Conversation

@marmbrus
Copy link
Copy Markdown
Contributor

@marmbrus marmbrus commented Apr 7, 2014

No description provided.

@rxin
Copy link
Copy Markdown
Contributor

rxin commented Apr 7, 2014

lgtm

@AmplabJenkins
Copy link
Copy Markdown

Merged build triggered.

@AmplabJenkins
Copy link
Copy Markdown

Merged build started.

@AmplabJenkins
Copy link
Copy Markdown

Merged build finished. All automated tests passed.

@AmplabJenkins
Copy link
Copy Markdown

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/13838/

@rxin
Copy link
Copy Markdown
Contributor

rxin commented Apr 7, 2014

merged.

@asfgit asfgit closed this in b5bae84 Apr 7, 2014
@marmbrus marmbrus deleted the toStringFix branch April 7, 2014 23:16
andrewor14 pushed a commit to andrewor14/spark that referenced this pull request Apr 7, 2014
Fix test breaking downstream builds

This wasn't detected in the pull-request-builder because it manually sets SPARK_HOME. I'm going to change that (it should't do this) to make it like the other builds.
pdeyhim pushed a commit to pdeyhim/spark-1 that referenced this pull request Jun 25, 2014
Author: Michael Armbrust <michael@databricks.com>

Closes apache#343 from marmbrus/toStringFix and squashes the following commits:

37198fe [Michael Armbrust] Fix toString for SchemaRDD NativeCommands.
mccheah pushed a commit to mccheah/spark that referenced this pull request Oct 3, 2018
arjunshroff pushed a commit to arjunshroff/spark that referenced this pull request Nov 24, 2020
LuciferYang added a commit that referenced this pull request Mar 17, 2026
### What changes were proposed in this pull request?
This pr aims to upgrade `arrow-java` from 18.3.0 to 19.0.0.

It also fixes a buffer leak in `SparkResult.processResponses()` that **only manifests after this upgrade** and has **no actual impact under Arrow 18.3.0**. The issue is that when a deserialized Arrow batch contains 0 rows, the `ArrowMessage` objects were silently dropped without calling `close()`, and were not stored in `resultMap` (so `SparkResultCloseable.close()` would not release them either). Under Arrow 18.3.0 this was completely harmless — empty batches produced a 0-byte IPC body, which goes through `BaseAllocator.buffer(0)` → `getEmpty()` (a singleton backed by `EmptyReferenceManager` whose `retain()`/`release()` are no-ops and not tracked by the allocator), so no off-heap memory was ever allocated or leaked. However, Arrow 19.0.0 includes [GH-343](apache/arrow-java#343), which correctly serializes offset buffers for empty vectors per the Arrow spec, making the IPC body non-zero. This causes real tracked off-heap buffers to be allocated, and the missing `close()` becomes a real memory leak detectable by `allocator.close()`. Therefore this fix is included as a necessary companion change for the 19.0.0 upgrade.

### Why are the changes needed?
The full release note as follows:
- https://github.com/apache/arrow-java/releases/tag/v19.0.0

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
- Pass GitHub Acitons

### Was this patch authored or co-authored using generative AI tooling?
No

Closes #54820 from LuciferYang/SPARK-56000.

Lead-authored-by: yangjie01 <yangjie01@baidu.com>
Co-authored-by: YangJie <yangjie01@baidu.com>
Signed-off-by: yangjie01 <yangjie01@baidu.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants