🎯
Focusing
Data Engineer:
Python | SQL |
Spark | Kafka | AWS | Docker | Airflow |
Data Modeling, Data Pipelines, Data Warehousing, Data Lakes, Analytics
-
A datalake house project build with Spark, Airflow, Docker, Minio and much more !
Python UpdatedJul 7, 2025 -
gcp-music-event-streaming Public
Data Pipeline built using Airflow, dbt, Kakfa, Spark, GCP, Docker, Terraform
Python UpdatedOct 17, 2024 -
streaming-kafka-pyspark-avro Public
Streaming Data Pipeline for NYC taxi Rides build using kafka and Pyspark locally running on docker containers
Python UpdatedSep 26, 2024 -
de-zoomcamp Public
My Learning from DataTalksClub data-engineering-zoomcamp
Python UpdatedJul 15, 2024 -
Data Pipeline for unstructured data on AWS with spark
Python UpdatedJul 12, 2024