Develop and maintain a service that monitors AWS quotas and automatically submits increase requests across Salesforce Hyperforce, ensuring uninterrupted scalability for global customers
•
Built and operate a deployment risk API used by Salesforce’s deployment service to determine whether production rollouts can proceed safely, reducing risk of service-impacting changes
•
Provide technical support through internal channels, debugging complex infrastructure issues, guiding engineers, and expanding service coverage to GovCloud and GCP
Served as Incident Commander during high-severity outages, coordinating response across teams and reducing mean time to recovery (MTTR) for mission-critical services
•
Designed and implemented dashboards for key services to improve operational visibility, enabling faster diagnosis of availability and latency issues
•
Led post-incident reviews and introduced preventative measures to reduce recurrence of high-impact incidents
Created a CLI-based support chatbot for internal build tools, leveraging NLP techniques to parse developer queries and recommend relevant documentation
•
Applied deep learning models to analyze logs, automatically diagnose common failures, and suggest recovery actions