Scope
- Use cases
- Capture JSON parse errors, schema mismatches, or business validation failures.
- Persist failure payloads with reasons and timestamps for later fixes and replays.
- Non-use cases
- Complex semantic validation pipelines.
Common steps
Build context
- Identify critical streams (e.g.,
OrderCompleted,PaymentInfoAdded). - Define failure categories and DLQ storage (Kafka topic or Iceberg table).
Implementation notes
- Flink SQL connectors may support
ignore-parse-errors; combine with a side output for failures using custom functions, or implement parsing in a Flink job. - Store original payload and error details for replay.
RESINK.AI recommendations
Example
Variations
- DLQ to Kafka topic
Troubleshooting
High DLQ volume spikes
High DLQ volume spikes
Alert on DLQ rates; throttle or temporarily buffer in object storage during incidents.
Cannot replay DLQ events cleanly
Cannot replay DLQ events cleanly
Store the original payload and a canonical timestamp; build a replay job that reuses the same parsing logic.

