Designed and implemented real-time and batch ETL pipelines, as well as external-facing APIs, enhancing mission-critical data delivery for organizations including NASA, NOAA, Coast Guard, and Navy.
Designed and scaled a cloud-based data lake on Amazon S3 with Presto/Trino, enabling ML teams to efficiently query and analyze petabyte-scale datasets for model training
Implemented large-scale data processing and analytics using Spark/Databricks, lowering processing times from days to hours.
Designed and optimized database solutions to support both operational and analytical workloads.