Skip to content
View adityachaudhary99's full-sized avatar

Highlights

  • Pro

Block or report adityachaudhary99

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
adityachaudhary99/README.md

Hello, I'm Aditya Chaudhary waving hand gif

Data Engineer | Backend Developer | DevOps Enthusiast

LinkedIn Twitter

πŸ“Š Data Engineering Journey

I build robust data pipelines and analytics infrastructure that transform raw data into valuable insights. My expertise spans the entire data engineering lifecycle - from ingestion to transformation to serving.

πŸ’» Tech Ecosystem

Core Languages Python, Golang, SQL
Python Ecosystem FastAPI, Streamlit, Pydantic, Polars, BeautifulSoup, Selenium, dlt
Big Data & Processing Apache Spark, Kafka, Hadoop
Cloud & Infrastructure GCP, AWS, Docker, Kubernetes, Terraform
Data Platforms Snowflake, Databricks, DBT
Orchestration Airflow, Mage
Databases PostgreSQL, MongoDB, Cassandra, Elasticsearch, SQLite, MySQL
Visualization Tableau, Looker Studio

πŸš€ Featured Projects

Data File Viewer – VS/Cursor Code Extension

VS Code extension to view and explore binary data files directly in the editor. Supports 11 formats including pkl, h5, parquet, feather, joblib, npy, npz, msgpack, arrow, avro, nc, and mat.

Technologies: TypeScript, Python, VS Code API, Virtual Environments

Data File Viewer repo

AWS Terraform Multi-Environment Template

Production-ready Infrastructure as Code template for AWS with support for dev, staging, and prod environments. Features modular architecture and remote state management.

Technologies: Terraform, AWS, IaC, Multi-Environment Setup

AWS Terraform Template repo

Parallelization Engine

Distributed parallelization engine using Docker, Celery, and RabbitMQ for scalable task execution. Features dynamic worker scaling, fault tolerance, and throughput optimization.

Technologies: Python, Docker, Celery, RabbitMQ, Distributed Systems

Parallelization-Engine repo

Motor Vehicle Collision Analysis Pipeline

End-to-end ETL pipeline that processes traffic accident data to identify patterns and risk factors.

Technologies: Python, Apache Airflow, Spark, Data Visualization

Motor Vehicles Collision Analysis repo

Real Estate Analysis Pipeline

Data pipeline that aggregates property listing data to generate market insights and trends.

Technologies: Python, DBT, Snowflake, Data Modeling

Real Estate Analysis repo

LinkedIn Network Analyzer

Tool that extracts and processes professional network data to uncover industry trends and connection opportunities.

Technologies: Python, Selenium, BeautifulSoup, MongoDB

LinkedIn Scraper repo

🏸 Life Beyond Code

When I'm not building data pipelines, you might find me on the badminton court, swimming laps, or exploring new destinations.

Pinned Loading

  1. data-file-viewer data-file-viewer Public

    View and explore binary data files in VS Code/Cursor - 11 formats supported (pkl, h5, parquet, feather, joblib, npy, npz, msgpack, arrow, avro, nc, mat)

    TypeScript 2

  2. aws-terraform-multi-env-template aws-terraform-multi-env-template Public

    Production-ready Terraform template for AWS multi-environment deployments with ECS Fargate, RDS, ALB, and CI/CD. Includes dev, staging, and prod environments.

    HCL

  3. dataengineering-motor-vehicles-collision-analysis-project dataengineering-motor-vehicles-collision-analysis-project Public

    Jupyter Notebook 1

  4. dataengineering-real-estate-analysis-project dataengineering-real-estate-analysis-project Public

    Python