California, United States
• Built and developed a scalable next-gen telemetry pipeline that collects hundreds of metrics from network devices across roblox network using gNMI, Netconf and REST
• Designed, tested, and deployed a three node kafka cluster for telemetry pipeline to ensure high availability and scalability of network data
• Led the migration of postgres databases of our applications from on-prem to AWS RDS and enabled automated failover capabilities, monitoring, automated configs using terraform, and better security.
• Led validation of new vendors into the global roblox network for its compatibility with our tools and the network
• Led the team to improve the workflow of asset management lifecycle of company's infrastructure assets using service now and assetvue
• Deployed RPKI for our backbone network to secure roblox infrastructure from malicious BGP routing hijacks and leaks