[#2649] feat(spark): Introduce timeout mechanism when getting the decompressing data #2651

zuston · 2025-10-22T08:20:44Z

What changes were proposed in this pull request?

This PR is to introduce the timeout mechanism when getting the overlapping decompression data.

Why are the changes needed?

If not having this PR, the blocking wait have the potential risk to forever hang of the tasks when the rpc hang

Does this PR introduce any user-facing change?

rss.client.read.overlappingDecompressionFetchSecondsThreshold=-1, this mechanism will be disabled by default.

How was this patch tested?

Internal job tests

github-actions · 2025-10-22T08:46:10Z

Test Results

3 150 files ±0 3 150 suites ±0 6h 50m 42s ⏱️ +30s
1 210 tests ±0 1 209 ✅ ±0 1 💤 ±0 0 ❌ ±0
15 314 runs ±0 15 299 ✅ ±0 15 💤 ±0 0 ❌ ±0

Results for commit aed7490. ± Comparison against base commit d9815c0.

♻️ This comment has been updated with latest results.

jerqi · 2025-10-22T08:58:57Z

Is it ok if the data is big?

zuston · 2025-10-23T07:48:45Z

Is it ok if the data is big?

Normally it wont happen that the block size is controlled by the writer that the block will be flushed when reaching the threshold.

jerqi · 2025-10-23T07:50:17Z

Is it ok if the data is big?

Normally it wont happen that the block size is controlled by the writer that the block will be flushed when reaching the threshold.

One record may be large. For example, I ever saw 100MB record.

zuston · 2025-10-23T08:22:30Z

Is it ok if the data is big?

Normally it wont happen that the block size is controlled by the writer that the block will be flushed when reaching the threshold.

One record may be large. For example, I ever saw 100MB record.

Maybe the default threshold value could be more larger. I hope this PR could make task fail as fast as possible rather than hang that will be terrible

jerqi · 2025-10-23T09:07:03Z

Is it ok if the data is big?

Normally it wont happen that the block size is controlled by the writer that the block will be flushed when reaching the threshold.

One record may be large. For example, I ever saw 100MB record.

Maybe the default threshold value could be more larger. I hope this PR could make task fail as fast as possible rather than hang that will be terrible

Just give u some input. Should the timeout consider the data length? If u think it is ok to keep current state, I'm ok, too. Because this is the corner cases.

…he decompressing data

jerqi previously approved these changes Oct 29, 2025

View reviewed changes

zuston dismissed jerqi’s stale review via 1434020 October 29, 2025 08:56

zuston requested a review from jerqi October 29, 2025 08:57

zuston added 2 commits November 3, 2025 19:56

[apache#2649] feat(spark): Introduce timeout mechanism when getting t…

b777ada

…he decompressing data

disable this by default

aed7490

zuston force-pushed the readtimeout branch from 1434020 to aed7490 Compare November 3, 2025 11:56

jerqi approved these changes Nov 4, 2025

View reviewed changes

zuston merged commit 17d2b25 into apache:master Nov 4, 2025
41 checks passed

zuston deleted the readtimeout branch November 4, 2025 09:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[#2649] feat(spark): Introduce timeout mechanism when getting the decompressing data #2651

[#2649] feat(spark): Introduce timeout mechanism when getting the decompressing data #2651

zuston commented Oct 22, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 22, 2025 •

edited

Loading

Uh oh!

jerqi commented Oct 22, 2025

Uh oh!

zuston commented Oct 23, 2025

Uh oh!

jerqi commented Oct 23, 2025

Uh oh!

zuston commented Oct 23, 2025

Uh oh!

jerqi commented Oct 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[#2649] feat(spark): Introduce timeout mechanism when getting the decompressing data #2651

[#2649] feat(spark): Introduce timeout mechanism when getting the decompressing data #2651

Conversation

zuston commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results

Uh oh!

jerqi commented Oct 22, 2025

Uh oh!

zuston commented Oct 23, 2025

Uh oh!

jerqi commented Oct 23, 2025

Uh oh!

zuston commented Oct 23, 2025

Uh oh!

jerqi commented Oct 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zuston commented Oct 22, 2025 •

edited

Loading

github-actions bot commented Oct 22, 2025 •

edited

Loading