datalake: dlq corrupted records #27327

nvartolomei · 2025-08-21T15:05:24Z

ref https://redpandadata.atlassian.net/browse/INC-923

Backports Required

Release Notes

Bug Fixes

datalake: handle corrupted records by writing them to DLQ table.

andrwng · 2025-08-23T00:57:28Z

src/v/datalake/record_multiplexer.cc

@@ -609,4 +639,42 @@ record_multiplexer::handle_invalid_record(
        co_return std::nullopt;
    }
 }
+
+ss::future<result<void, writer_error>>
+record_multiplexer::handle_corrupted_record(


IMO this would be less surprising for callers / more accurate if it were called handle_corrupted_batch(), given we're treating the rest of the batch as corrupted, not just one record

bharathv · 2025-08-27T18:49:08Z

src/v/datalake/record_multiplexer.cc

+                  _log.warn,
+                  "Error reading record from batch: {} at index {}. Err: {}",
+                  batch.header(),


nit: probably warrants an "error" log given the rarity of this situation.

bharathv · 2025-08-27T18:57:48Z

src/v/datalake/record_multiplexer.cc

+          translation_probe::invalid_record_cause::corrupted_record,
+          o,
+          std::nullopt,
+          data_copy.share(0, data_copy.size_bytes()),


I'm wondering if we can only have the data payload for the first offset..

In the particular case where this crashed, the batch was > 100 MiB, seems wasteful to copy it over so many times.

datalake: add new metric for corrupted records

13ae152

nvartolomei requested review from andrwng and bharathv and removed request for andrwng August 21, 2025 15:05

github-actions bot added the area/redpanda label Aug 21, 2025

nvartolomei requested a review from andrwng August 21, 2025 15:05

datalake: dlq corrupted records

0d6313c

nvartolomei force-pushed the nv/datalake-dlq-corrupted-records branch from 6c68128 to 0d6313c Compare August 21, 2025 17:50

andrwng approved these changes Aug 23, 2025

View reviewed changes

bharathv reviewed Aug 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

datalake: dlq corrupted records #27327

datalake: dlq corrupted records #27327

Uh oh!

nvartolomei commented Aug 21, 2025 •

edited

Loading

Uh oh!

andrwng Aug 23, 2025

Uh oh!

bharathv Aug 27, 2025

Uh oh!

bharathv Aug 27, 2025

Uh oh!

Uh oh!

datalake: dlq corrupted records #27327

Are you sure you want to change the base?

datalake: dlq corrupted records #27327

Uh oh!

Conversation

nvartolomei commented Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Backports Required

Release Notes

Bug Fixes

Uh oh!

andrwng Aug 23, 2025

Choose a reason for hiding this comment

Uh oh!

bharathv Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

bharathv Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nvartolomei commented Aug 21, 2025 •

edited

Loading