Filtering and sampling streams - CS 591 K1: Data Stream Processing and Analytics Spring 2020numeric time series. What summary would let us compute the statistical variance of this series? 3 var = ∑ (xi − μ)2 N ??? Vasiliki Kalavri | Boston University 2020 A simple and efficient synopsis Suppose the sum of all the values • the sum of the squares of the values • the number of observations var = ∑ (xi − μ)2 N ??? Vasiliki Kalavri | Boston University 2020 A simple and efficient synopsis Suppose the squares of the values • the number of observations • μ = sum / count • var = (sum of squares / count) - μ2 Then var = ∑ (xi − μ)2 N ??? Vasiliki Kalavri | Boston University 2020 A simple and0 码力 | 74 页 | 1.06 MB | 1 年前3
Notions of time and progress - CS 591 K1: Data Stream Processing and Analytics Spring 2020PeriodicAssigner extends AssignerWithPeriodicWatermarks[Reading] { val bound: Long = 60 * 1000 // 1 min in ms var maxTs: Long = Long.MinValue // the max observed timestamp override def getCurrentWatermark: Watermark0 码力 | 22 页 | 2.22 MB | 1 年前3
State management - CS 591 K1: Data Stream Processing and Analytics Spring 2020extends RichFlatMapFunction[Reading, (String, Double, Double)] { // the state handle object private var lastTempState: ValueState[Double] = _ override def open(parameters: Configuration): Unit = { // create0 码力 | 24 页 | 914.13 KB | 1 年前3
PyFlink 1.15 DocumentationPython-3.7.9 && \ ln -s /usr/local/bin/python3 /usr/local/bin/python && \ apt-get clean && \ rm -rf /var/lib/apt/lists/* # install PyFlink RUN pip3 install apache-flink==1.15.2 Execute PyFlink jobs in application0 码力 | 36 页 | 266.77 KB | 1 年前3
PyFlink 1.16 DocumentationPython-3.7.9 && \ ln -s /usr/local/bin/python3 /usr/local/bin/python && \ apt-get clean && \ rm -rf /var/lib/apt/lists/* # install PyFlink RUN pip3 install apache-flink==1.15.2 Execute PyFlink jobs in application0 码力 | 36 页 | 266.80 KB | 1 年前3
共 5 条
- 1













