Apache Flink is an open source stream processing framework that originated from a research project at TU Berlin in 2009. It is undergoing incubation at the Apache Software Foundation and allows for fast and reliable processing of big data. Flink provides high-level APIs for the Java and Scala programming languages to define and execute dataflow programs on top of distributed datasets.