Beaverton, Oregon, United States
Project: Financial Data Platform Modernization
• Led migration of legacy Airflow/EMR pipelines to Databricks, improving scalability and auditability,
resulting in a 15% cost reduction.
• Designed and implemented PySpark ETL pipelines across raw, cleansed, and curated layers with
full lineage and version control, cutting future development effort to half.
• Orchestrated data optimization by applying Liquid Clustering on financial datasets for regulatory
reporting, shrinking storage costs by 30% and improving query performance by 4x.
• Established a robust data governance framework around Snowflake, ensuring 99.9% data accuracy
for regulatory compliance and enabling 75+ users to access insights via Cognos under IAM controls.
• Led an engineering squad of 35 engineers, overseeing sprint planning, risk management,
stakeholder alignment, and production support, which reduced defects by 30%.
• Managed the migration of 2+ years of legacy financial data from S3 to Delta Lake with full
compliance and decommissioned outdated EMR workflows with zero business disruption.