Skip to content

[doc] add FAQ for Stream Load Broken pipe when syncing large tables via Flink CDC#3597

Closed
JNSimba wants to merge 2 commits intoapache:masterfrom
JNSimba:flink-faq-oracle-cdc-parallel
Closed

[doc] add FAQ for Stream Load Broken pipe when syncing large tables via Flink CDC#3597
JNSimba wants to merge 2 commits intoapache:masterfrom
JNSimba:flink-faq-oracle-cdc-parallel

Conversation

@JNSimba
Copy link
Copy Markdown
Member

@JNSimba JNSimba commented Apr 29, 2026

Summary

Add a new FAQ entry to the Flink Doris Connector docs (English/Chinese, dev/4.x) for the I/O exception (java.net.SocketException) ... Broken pipe error that users may encounter when syncing large tables (e.g. Oracle) via Flink CDC.

The FAQ explains the root cause (single Stream Load data volume exceeding the BE-side limit) and lists three mitigations:

  • Increase BE streaming_load_max_mb.
  • Enable Connector batch mode (sink.enable.batch-mode=true).
  • Increase Oracle CDC parallelism with --oracle-conf scan.incremental.snapshot.enabled=true (experimental).

Also references the upstream Flink CDC FAQ for further reading.

Files changed

  • docs/ecosystem/flink-doris-connector/flink-doris-connector.md (English dev)
  • i18n/zh-CN/.../current/ecosystem/flink-doris-connector/flink-doris-connector.md (Chinese dev)
  • versioned_docs/version-4.x/ecosystem/flink-doris-connector/flink-doris-connector.md (English 4.x)
  • i18n/zh-CN/.../version-4.x/ecosystem/flink-doris-connector/flink-doris-connector.md (Chinese 4.x)

JNSimba and others added 2 commits April 23, 2026 10:40
Rename the streaming job doc concepts to eliminate the misleading
"Single Table Sync / Full Database Sync" dichotomy. The two are
fundamentally different mechanisms (Job + TVF vs. native
FROM ... TO DATABASE DDL), not a "table count" distinction.
Database-level Sync can also sync a single table via include_tables.

- Single Table Sync -> Table-level Sync (单表同步 -> 表级同步)
- Full Database Sync -> Database-level Sync (整库同步 -> 库级同步)
- Rename files via git mv (continuous-load-*-{single,multi} -> {table,database})
- Update sidebars.ts and version-4.x-sidebars.json
- Update cdc-stream.md cross-references
- Add "How to Choose" capability-comparison section to overview pages
- Add "Job Lifecycle" state-machine section to overview pages

Applied to dev (current) and 4.x versions, both Chinese and English.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@JNSimba
Copy link
Copy Markdown
Member Author

JNSimba commented Apr 29, 2026

Replaced by a clean PR based on master.

@JNSimba JNSimba closed this Apr 29, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant