This website uses cookies
We use cookies to continuously improve your experience on our site. More info.
A unified model for defining both batch and streaming data-parallel processing pipelines. Provides a general approach to expressing embarrassingly parallel data processing pipelines and supports end users, SDK writers, and runner writers.
First released | June 15, 2016 |
Developed by | Apache Software Foundation |
Open-source | Yes |
The Beam pipeline is executed by one of the distributed processing back-ends, which include Apache Apex, Apache Flink, Apache Spark, and Google Cloud Dataflow.