Skip to content
View rohankumardubey's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Highlights

  • Pro

Organizations

@StreamNest

Block or report rohankumardubey

Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
rohankumardubey/README.md

โšก Distributed Systems / Big Data Engineer / Data Platform


๐ŸŒ Building Systems, One Node at a Time โ€” I'm Rohan

"Distributed Systems Engineer/ Big Data Engineer/ Data Platform Engineer" focused on designing and building scalable, fault-tolerant, and high-performance distributed systems. I specialize in asynchronous and concurrent programming, low-latency architectures, and real-time data streaming and processing.

I have hands-on experience with Java, Python, Go, Rust, Scala, and C++, and I actively explore system design, performance optimization, and distributed data platforms. Passionate about open-source and contributing to the distributed systems ecosystem.

๐Ÿ’ก Tech stack: Java, Python, Go, Rust, Scala, C++
๐ŸŒ Open-source enthusiast & system design explorer.


โš™๏ธ Distributed Systems & Data Engineering Stack ๐Ÿš€

  • ๐Ÿง  Compute & Processing: Apache Spark, Apache Flink, Ray, Apache Hadoop
  • ๐ŸŒŠ Streaming & Messaging: Apache Kafka, Pulsar, Kafka Streams, RabbitMQ, NATS
  • ๐Ÿงฎ Query & Analytics Engines: Presto, Trino, Dremio, ClickHouse, DuckDB, Arrow
  • ๐Ÿ—„๏ธ Lakehouse & Storage: Apache Iceberg, Delta Lake, Hudi, MinIO
  • ๐Ÿงฑ Databases: Cassandra, ScyllaDB, CockroachDB, TiDB, MongoDB
  • ๐Ÿงฌ Data Platforms: Airbyte, Druid, Pinot, Drill, Snowflake, BigQuery
  • โ˜๏ธ Infra & Orchestration: Kubernetes, Docker, Terraform, Apache Airflow, Argo
  • โšก High-Performance Systems: Rust, gRPC, WASM, Vectorized Execution

๐Ÿ“Š GitHub Stats


๐Ÿ”ฅ Contribution Streak


๐ŸŒ Connect With Me


๐Ÿ“ˆ Activity Graph

Pinned Loading

  1. flink flink Public

    Forked from apache/flink

    Apache Flink

    Java

  2. FlowCore FlowCore Public

    FlowCore is a lightweight, Rust-powered real-time stream processing engine inspired by Apache Flink. It supports event-time processing, tumbling windows, watermarks, late-event handling, and checkpโ€ฆ

    Rust 1

  3. spark spark Public

    Forked from apache/spark

    Apache Spark - A unified analytics engine for large-scale data processing

    Scala 2

  4. ClickHouse ClickHouse Public

    Forked from ClickHouse/ClickHouse

    ClickHouseยฎ is a real-time analytics database management system

    C++

  5. tidb tidb Public

    Forked from pingcap/tidb

    TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.

    Go

  6. DataWizz DataWizz Public

    DataWizz is a data platform which combines file ingestion, SQL exploration, Delta Lake publishing, multi-engine notebooks, low-code orchestration, and business dashboards in one local-first workspace.

    TypeScript 1