I’m a platform and backend engineer focused on building scalable cloud systems that teams can operate with confidence.
Experience
2024 — Now
Sunnyvale, California, United States
I own and continuously improve the Log, Metrics, and Monitoring platform for Amazon’s device organization, delivering observability for millions of devices and enabling thousands of developers and QA workflows daily. My focus is building scalable telemetry infrastructure (logs + metrics), reliable querying and dashboards, and monitoring/alerting systems that improve fleet reliability and accelerate incident response.
Key contributions include scaling the end-to-end log and metrics pipeline (ingestion → storage/indexing → query → dashboards) to make debugging faster and more dependable; designing real-time device health monitoring and alerting with fleet-wide automation for operational tasks; and architecting an AI-assisted log triage system using Amazon Bedrock, RAG, and Claude—integrated with Slack, internal web portals, and paging workflows—adopted org-wide and reducing MTTR for critical device incidents. I also led a multi-service migration to a left-shift deployment strategy from the DUB region, improving release velocity and reducing production incidents across the organization.
2024 — 2024
Sunnyvale, California, United States
internal systems. My work spans backend services, cloud infrastructure, and CI/CD automation, with an emphasis on operational excellence (monitoring, alerting, incident response) and scalable architecture.
Key contributions include improving telemetry pipelines and dashboards for faster root-cause analysis; designing resilient service patterns and deployment workflows to reduce production incidents; and partnering across teams to drive standards for reliability, automation, and cost/performance efficiency across shared infrastructure.
2021 — 2023
At Instawork, I led payments and platform engineering and drove multiple high-impact initiatives across product, infrastructure, and risk. I designed and launched a fully automated payments system (including Instawork debit cards) that improved payout efficiency and significantly reduced manual operations and support overhead. In parallel, I spearheaded an infrastructure transformation on AWS by establishing Infrastructure-as-Code practices with Terraform—building the foundations, team practices, and delivery standards from the ground up. I also led Trust & Safety efforts across multiple projects to strengthen payment integrity and ensure secure, on-time processing for customers at scale.
2014 — 2021
2014 — 2021
Led architecture and delivery of a real-time writing platform on AWS using event-driven patterns. Built scalable APIs and backend services that enabled seamless third-party integrations and supported millions of users with high availability and low latency.
Modernized billing and payments across both real-time and subscription workflows. Strengthened risk controls and automation, improving reliability end-to-end and reducing payment fraud to near zero.
Built and scaled engineering teams from the ground up. Set technical direction, mentored engineers, established delivery and quality standards, and drove execution across multiple product lines in partnership with cross-functional stakeholders.
2007 — 2014
Earlier in my career, I worked across backend platforms, performance engineering, and automation at Qualcomm, Samsung Electronics, and HCL Technologies. I built performance benchmarking frameworks for WebKit/Chromium/V8, and developed automation and interoperability testing infrastructure across Android, mobile, and web platforms. I also built backend APIs for payments and search in a high-scale product environment.
Education
ITM Dehradun