•Architected and delivered a scalable dynamic endpoint-discovery component for the ping monitoring
platform, enabling robust, high-availability health checks across all Workday customer tenants endpoints.
•Drove the platform’s multi-region rollout, coordinating deployment, configuration, and validation
to establish a second AWS region and improve global observability accuracy by removing cross-
region network noises.
•Architected and built Workday’s internal service-ping infrastructure, establishing a Tekton-based
self-service onboarding pipeline that streamlined adoption across teams.
•Led planning, PoC, and delivery of the Fed IL4 of the ping service. Collaborated cross-
functionally with Security, Compliance, and Federal teams to meet federal requirements and elevate
platform security posture.
•Designed, implemented, and deployed a Fluent Bit–based logging pipeline to ingest service logs
into Kibana, establishing the standard logging pattern and delivering cross-team knowledge transfer
adopted across the Observability organization.
•Resolved core conflicts between Watchdog and the customized Prometheus Operator, enabling
reliable platform-level health monitoring for the ping service and improving system stability.
•Designed and delivered a new onboarding workflow for newly acquired Workday services, enabling
them to integrate with the ping platform for tenant-level SLA calculation.
•Refined SRE alerting logic to eliminate false signals caused by manual operations during major
incidents, improving alert accuracy and ensuring more reliable MTTR pipeline calculations.
•On-call rotation responsibility for diagnosing false alerts and maintaining real-time service health
dashboard accuracy.