Skip to content
View sreyas-lankala's full-sized avatar

Block or report sreyas-lankala

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sreyas-lankala/README.md

Hi, I'm Sreyas Lankala 👋

Data Quality & Governance Engineer | Building trusted data platforms at scale

LinkedIn Email GitHub


About Me

I'm a Data Quality & Governance Engineer with 5+ years of professional experience building the infrastructure that makes data teams trust their data.

  • 🏢 Previously: Amazon (Data Operations) · Hexaware Technologies (Data Engineer – DQ & Governance) · Mphasis (Data Specialist)
  • 🎓 MS Computer Science · Concordia University of Wisconsin · May 2026
  • 📍 Madison, WI · Open to relocation across the USA
  • 🛂 Authorized to work in the USA on F-1 OPT starting June 2026 (STEM OPT eligible – 3-year authorization)
  • 💡 Obsessed with one question: How do you make data trustworthy at scale?

🛠️ Tech Stack

Data Quality & Governance

Great Expectations dbt Collibra Apache Atlas Microsoft Purview

Data Engineering & Pipelines

Snowflake Apache Airflow Apache Spark

Languages & Tools

Python SQL PySpark Power BI

Cloud

AWS Azure GCP


🚀 Featured Projects

Airflow · Azure Data Lake · Microsoft Purview · dbt · Python · SQL

Enterprise-grade data governance platform processing 2M+ synthetic clinical records (Synthea). Automated validation frameworks, full metadata catalog, data lineage tracking, and observability dashboards built to meet real-world healthcare compliance standards.

Key highlights: Data quality rules engine · Metadata cataloging · Schema drift detection · Lineage mapping · SLA monitoring


Snowflake · dbt · Apache Airflow · GitHub Actions · SQL · Python

Full-stack enterprise data platform with medallion architecture (RAW → STAGING → MART), metadata-driven quality rule engine, and operational observability layer.

By the numbers: 40+ SQL scripts · 25+ dbt models · 65+ data quality rules · 6 governance schemas · CI/CD via GitHub Actions


PostgreSQL · SQL · Data Analysis

Advanced SQL analytics — customer segmentation, revenue analysis, cohort queries, and business KPI reporting.


📈 Professional Impact

Company Role Key Achievement
Hexaware Technologies Data Engineer – DQ & Governance 200+ quality rules · 99.9% cross-system accuracy · MTTR ↓25%
Amazon Data Operations Associate 99%+ accuracy · 500+ incidents resolved
Mphasis Data Specialist Dataset accuracy ↑15% · 100K+ records profiled

🎯 Currently Seeking

Full-time roles in the USA (OPT starting May 27, 2026) in:

  • Data Quality Engineer
  • Data Governance Analyst
  • Data Engineer
  • Analytics Engineer

📬 Let's connect: sreyaslankala@gmail.com · LinkedIn

Pinned Loading

  1. enterprise-data-platform enterprise-data-platform Public

    Enterprise data quality & governance platform: Snowflake · dbt · Airflow · medallion architecture · 65+ quality rules · metadata governance · observability

    Python 1