Twitter data is extracted using ETL data pipelines and Airflow workflows are implemented.
-
Updated
Feb 15, 2023 - Python
Twitter data is extracted using ETL data pipelines and Airflow workflows are implemented.
Starter package to setup Apache Airflow locally.
Apache Airflow Cheatsheet
Process of scheduled data extraction, transform and load is done using Apache Airflow and PySpark
This repo contains the concepts of Apache Airflow and the practical implemetation I'll be doing while learning.
Automate Apache Spark in Hadoop with Airflow in Cloud
A simple dag for triggering the Cloud Data Fusion Pipeline using Apache Airflow.
An example Apache Airflow DAG-definition source repository, to be used with the Airflow DAG Aggregator.
Setup for Apache Airflow with Docker.
Udacity project within the Data Engineer Nanodegree
Project in Course of Udacity's Data Engineering Nano-Degree
Celery and Kubernetes operators are used in order to manage data engineering pipelines of stocks and cryptocurrencies prices
The ETL Pipeline using a way autoscaling
A simple DataOps for wine dataset on Docker
Playing around with Airflow
This repository contains the projects I completed in the Udacity Data Engineering Nanodegree.
Add a description, image, and links to the airflow topic page so that developers can more easily learn about it.
To associate your repository with the airflow topic, visit your repo's landing page and select "manage topics."