San Francisco, California, United States
• Led the overhaul of the staging environment for 100+ microservices across 30+ teams, increasing deployment frequency by 35%, reducing lead times by 25%, and cutting customer-facing incidents by 40%.
• Mobilized a 20% capacity commitment from teams through a data-driven business case that positioned internal tools as a core innovation enabler.
• Piloted ephemeral developer environments to achieve a 5x reduction in setup times, later transitioning to a persistent, mock-data solution to improve workflow consistency.
• Developed a framework leveraging DORA metrics to measure development and maintenance overhead, enabling teams to prioritize work effectively and estimate impact. This framework supported infrastructure migrations (Kops → AWS EKS, custom secrets management → HashiCorp Vault, CloudKarafka → Confluent (Kafka)) that improved maintenance rates and reduced operational overhead.
• Optimized monitoring and alerting by negotiating PagerDuty contracts and piloting OpsGenie, securing more favorable pricing for our DataDog-based monitoring setup.
• Directed quarterly planning and vision casting for 8 teams (SRE, Infra, DBRE, Core Services, API Platform, UI Platform, Developer Productivity, Quality Productivity) by collaborating with engineering managers, directors, VPs, and our SVP to execute a predictable annual roadmap.