Apache Beam

A unified model for defining both batch and streaming data-parallel processing pipelines. Provides a general approach to expressing embarrassingly parallel data processing pipelines and supports end users, SDK writers, and runner writers.

Learn more

First released June 15, 2016
Developed by Apache Software Foundation
Open-source Yes

Interesting facts

The Beam pipeline is executed by one of the distributed processing back-ends, which include Apache Apex, Apache Flink, Apache Spark, and Google Cloud Dataflow.

Development by Synergize.digital

Sign up for updates
straight to your inbox