Data Engineering

Things I learn,
build, and break.

Honest writeups on pipelines, distributed systems, and cloud infrastructure.

Posts

Medallion Architecture: Why the Layers Actually Matter

Scaling a Real-Time IoT Pipeline to 4 Lakh Concurrent Connections

Implementing CDC in PySpark Without a Framework

GCP Data Engineering Stack: What Each Service Actually Does

Airflow DAG Patterns That Don't Break in Production