17 Matching Annotations
  1. Nov 2023
  2. nightlies.apache.org nightlies.apache.org
    1. Note that the write*() methods on DataStream are mainly intended for debugging purposes. They are not participating in Flink’s checkpointing, this means these functions usually have at-least-once semantics. The data flushing to the target system depends on the implementation of the OutputFormat. This means that not all elements send to the OutputFormat are immediately showing up in the target system. Also, in failure cases, those records might be lost. For reliable, exactly-once delivery of a stream into a file system, use the FileSink. Also, custom implementations through the .addSink(...) method can participate in Flink’s checkpointing for exactly-once semantics.

      生产禁用write*(),改用addSink()

  3. Dec 2022
  4. Jun 2022
  5. nightlies.apache.org nightlies.apache.org
    1. 每个 TaskManager 有一个 slot,这意味着每个 task 组都在单独的 JVM 中运行(例如,可以在单独的容器中启动)。具有多个 slot 意味着更多 subtask 共享同一 JVM。同一 JVM 中的 task 共享 TCP 连接(通过多路复用)和心跳信息。它们还可以共享数据集和数据结构,从而减少了每个 task 的开销。
    2. 算子链接成 task 是个有用的优化:它减少线程间切换、缓冲的开销,并且减少延迟的同时增加整体吞吐量。链行为是可以配置的;请参考链文档以获取详细信息。
  6. May 2022
  7. nightlies.apache.org nightlies.apache.org
    1. Those elements that do get joined will have as their timestamp the largest timestamp that still lies in the respective window. For example a window with [5, 10) as its boundaries would result in the joined elements having 9 as their timestamp.

      那些确实被加入的元素将具有仍然位于相应窗口中的最大时间戳作为它们的时间戳。例如,以 [5, 10) 作为其边界的窗口将导致连接的元素以 9 作为其时间戳。

  8. Feb 2021