GCP Dataflow Flex Template Pipeline ??? Part 1 (Overview)

In this project, we will be building a data pipeline using the GCP Dataflow service. We will be creating a Flex template in which you can pass API endpoint response, extract keys in the response, and then export it to BigQuery.

Repository with Full Code: https://github.com/amandeepsaluja/gcp-dataflow-pipeline/tree/master/01-flex-template

Steps in the Project:

  1. Create the Pipeline in Python
  2. Create the Docker Image
  3. Setup Terraform configuration files
  4. Build & deploy Docker Image of the Flex template on Artifact Registry (Terraform + GitHub Actions)
  5. Create and Deploy the Dataflow job (GitHub Actions)

Click Here

Tags: Dataflow GCP