📝 11 新 Issues
✅ 10 关闭
✨ 10 新 PRs
🎉 20 合并
🔥 核心要点
- 🎉 PR #3193 修复 Paimon IOManager 内存泄漏问题,防止 /tmp 磁盘空间耗尽
- 🐛 PR #3148 修复 Coordinator 重启后无事件时指标未更新的问题
- 🚀 PR #3169 实现 Flink 非分区扫描的 Lake 过滤器下推优化
- 📚 PR #3142 添加 Spark 批量联合读取文档
- ✨ PR #3171 Helm Chart 新增 existingSecret 支持 SASL 凭证
💬 Issue/PR 动态
Issue 讨论
Issue #3084 Flink log table consumer threw an out-of-range offset exception
- @loserwang1024: 分析根本原因是远程下载失败时 completeFetch 无限等待,建议添加超时和异常传播机制
Issue #3179 Add documentation for FIP (Flink Improvement Proposal) contribution guide
- @wuchong: 提请关注 FIP 贡献指南文档问题
- @luoyuxia: 说明已有 Confluence FIP 流程文档,询问是否需要复制到官网或添加链接
Issue #3158 [client] Fix race condition in RemoteLogDownloader causing flaky testPrefetchNum
- @ankit-khare-2015: 请求合并更改以解决测试问题
- @wuchong: 分析测试失败根因是 acquire 和 release 之间的竞态条件,建议调整断言范围
Issue #3177 [server] Remove CDC log metrics from PK table write path to avoid dou…
- @binary-signal: 说明 CDC 日志指标是故意分开追踪的,不应移除,移除会破坏复制监控和存储核算
Issue #3192 [server] local-cluster.sh starts successfully but processes fail silently when JAVA_HOME is misconfigured
- @matrixsparse: 表示愿意处理此问题
- @Yohahaha: 感谢并提供进一步协助
Issue #3191 Support configurable time partition format for auto-partitioned tables
PR Review
PR #3116 [spark] Support filter pushdown for log tables
- @Copilot: 指出 IN 转换可能在空列表时抛出异常、JavaConverters 已弃用、流式读取未应用过滤器下推等问题
- @luoyuxia: 指出当日志格式非 Arrow 时可能直接抛出异常的风险
- @fresh-borzoni: 请求审核合并
- @YannByron: 表示同意合并
PR #3132 [server] Optimize RemoteLogFetcher with async prefetch for recovery
- @fresh-borzoni: 指出取消已完成 future 的内存泄漏、shutdownNow 不等待、fetch 被调用两次的竞态、添加 CancellationException 防御等多项问题
- @Kaixuan-Duan: 已更新代码处理所有反馈:添加终止等待、关闭迭代器逻辑、比较 segment id、添加 CancellationException 处理、重试机制等
PR #3158 [client] Fix race condition in RemoteLogDownloader causing flaky testPrefetchNum
- @Copilot: 指出 poll 在 acquire 之前可能违反优先级顺序,建议重新设计获取许可的顺序
- @wuchong: 同意 Copilot 分析,acquire 在 poll 之前是保证消费顺序的关键
- @ankit-khare-2015: 询问是否应该撤销更改继续调查
PR #3171 [helm] Add existingSecret support for SASL credentials
- @fresh-borzoni: 请求评审并说明重新设计了方案以减少重复
- @affo: 提出多项 nit 建议:添加注释块、质疑镜像选择、建议统一配置模式
PR #3142 [spark][doc] Add Spark batch union read
- @luoyuxia: 建议调整 SQL 示例与表定义一致、指出 table.datalake.format 非必需
- @Copilot: 指出 SQL 列名不匹配、文档中后缀概念未解释、功能表描述歧义等问题
PR #3008 [client] Fix corruption after zero-copy lazy parse ByteBuf
- @Copilot: 指出 createDeepElementGetter 和 createDeepFieldGetter 的 API 破坏性变更、缺少 Javadoc 导入等兼容性问题
PR #3196 [spark] Add numRowsRead custom metric and improve Spark UI operator name for FlussScan
- @Yohahaha: 展示了添加 numRowsRead 自定义指标前后的 Spark UI 对比截图
📝 新建 Issue/PR
Issues
- #3198 [docs] Titles are duplicated in the Authentication section @zhuyufeng0809
- #3192 [server] local-cluster.sh starts successfully but processes fail silently when JAVA_HOME is misconfigured @Yohahaha
- #3191 Support configurable time partition format for auto-partitioned tables @luoyuxia
- #3188 [test] Unstable test CoordinatorEventManagerTest.testMetricsUpdatedImmediatelyOnStartup @app/github-actions
- #3180 Add documentation for contributing a blog post @wuchong good first issue, component=website
- #3179 Add documentation for FIP (Flink Improvement Proposal) contribution guide @wuchong good first issue, component=website
- #3176 PK table messagesIn/bytesIn metrics double-counted due to CDC log @swuferhong
- #3173 [server] Auto partition retention cleanup delayed by DAY partition creation jitter @swuferhong
- #3172 [helm] Add existingSecret support for SASL credentials @fresh-borzoni
- #3170 [tiering] Coordinator Should Push Lake Config to Tiering Service via Heartbeat RPC @Yohahaha
- #3168 [flink] Lake filters shoud be pushed down for non-partitioned scans @luoyuxia
Pull Requests
- #3199 [docs] Fix duplicate titles in the Authentication section @zhuyufeng0809
- #3197 [server] Validate Java version >= 11 in config.sh to prevent silent startup failures @matrixsparse
- #3196 [spark] Add numRowsRead custom metric and improve Spark UI operator name for FlussScan @Yohahaha
- #3194 [server] make RemoteLogManifest sorted with startOffset @zuston
- #3190 [tiering] Fix Paimon IOManager leak in MergeTreeWriter @luoyuxia
- #3187 [lake] make compaction strategy min_input_files configurable @wattt3
- #3186 [WIP] feat: system table @gyang94
- #3182 [docs] add contribute a blog post guide (#3180) @tmchow
- #3177 [server] Remove CDC log metrics from PK table write path to avoid dou… @swuferhong
- #3174 [server] Fix auto partition drop delayed by DAY creation jitter @swuferhong
✅ 关闭 Issue/PR
已关闭 Issues
- #3172 [helm] Add existingSecret support for SASL credentials @fresh-borzoni
- #3168 [flink] Lake filters shoud be pushed down for non-partitioned scans @luoyuxia
- #3153 [helm] Add pod scheduling primitives (affinity, nodeSelector, tolerations, topologySpreadConstraints) @fresh-borzoni
- #3147 [server] Coordinator metrics not updated after restart when no events arrive @swuferhong
- #3129 [Test] Unstable test SparkStreamingTest read: log table @wuchong component=connector/spark
- #3122 [spark] Make LakeSplit extend Serializable to simplify Spark serialization @YannByron
- #3072 Csp_check is warning @luoyuxia
- #2848 [Feature] Provide built-in Flink SQL functions for RoaringBitmap construction and cardinality query (rbm32_build, rbm64_build, rbm32_cardinality, rbm64_cardinality) @matrixsparse
- #2652 [docs] Update roadmap page to match the 2026 goals @MehulBatra
- #2599 [docs] Add "Change Data Feed" feature introduction for "Primary Key Table" page @wuchong
已合并 PRs
- #3193 [tiering] Fix Paimon IOManager leak in MergeTreeWriter causing /tmp disk exhaustion @luoyuxia
- #3189 [bp][release-0.9][helm] Add existingSecret support for SASL credentials (#3171) @fresh-borzoni
- #3185 [bp][helm] Add pod annotations, labels, and pod disruption budget support (backport to release-0.9) @morazow
- #3184 [bp][helm] Support injecting environment variables and external secrets (backport to release-0.9) @morazow
- #3183 [helm] Backport helm chart improvements to release-0.9 @morazow
- #3181 [flink] Push down lake filters for non-partitioned scans @luoyuxia
- #3175 [helm] scheduling primitives (#3154) (backport to 0.9) @polyzos
- #3171 [helm] Add existingSecret support for SASL credentials @fresh-borzoni
- #3169 [flink] Push down lake filters for non-partitioned scans @luoyuxia
- #3167 [s3] Propagate path-style-access via delegation token @luoyuxia
- #3164 [spark] Exclude slf4j from spark connector shade @Yohahaha
- #3154 [helm] scheduling primitives @fresh-borzoni
- #3152 [fluss/ai agent] enable ai doc agent on website @MehulBatra
- #3149 [client] Ensure UPDATE_BEFORE and UPDATE_AFTER are never split across poll batches @loserwang1024
- #3148 [server] Fix coordinator metrics not updated when no events arrive @swuferhong
- #3142 [spark][doc] Add Spark batch union read @Yohahaha
- #3141 [spark] Fix flaky SparkStreamingTest by handling interruption when close streaming execution @Yohahaha
- #3123 [spark] Make LakeSplit extend Serializable to simplify Spark serialization @YannByron
- #2696 [docs/website] Update website with roadmap 2026 changes @MehulBatra
- #2695 [doc] Add info for changedatafeed in Log and Pk tables to match the existing industry format @MehulBatra