# Vedant Mhatre > Infra @ Rox | Prev. ByteDance Location: San Francisco Bay Area, United States Profile: https://flows.cv/vedantmhatre It started in 2014 with a Linux box and a curiosity for how things worked under the hood. Twelve years later, that curiosity has evolved into a career engineering distributed systems at massive scale. Most recently as an SRE at TikTok/ByteDance, I built custom tooling in Go and Python to manage complex multi-cloud environments across AWS, OCI, Azure, and GCP. My focus isn't just about "keeping the lights on." I believe in engineering self-healing systems and automated governance tools that eliminate configuration drift and reduce cloud spend. In high-velocity environments, I prioritize building the "0 to 1" infrastructure that allows a team to scale without the friction of manual operations. Core Focus: Platform Engineering: Building self-service developer platforms and CLI tools that standardize workflows and empower engineers to ship faster. Infrastructure Automation: Replacing manual ops with event-driven microservices and intelligent automation to ensure 99.9% reliability. Observability & FinOps: Architecting analytics dashboards that turn raw logs into actionable insights, specifically focused on optimizing cloud burn rate. ## Work Experience ### Software Engineer (Infra) @ Rox Jan 2026 – Present | San Francisco Bay Area ### Site Reliability Engineer @ TikTok Jan 2025 – Jan 2025 | San Jose, California, United States ### Site Reliability Engineer @ ByteDance Jan 2025 – Jan 2025 | San Jose, California, United States • Multi-Cloud Image Validation Service: Developed a Go microservice for comparing and validating machine images across AWS, OCI, Azure, and GCP. Leveraged goroutines for parallel data collection and Terraform for automated deployment. • Infrastructure Analytics Dashboard: Architected a Python-based analytics dashboard to consume and visualize multi-cloud drift data, providing engineers with self-serve insights for system upgrades and cleanup. • Chatbot for Infrastructure Cleanup: Built a Lark chatbot that automates drift remediation by fetching candidates, prompting owners for approval, and de-provisioning resources to reduce costs. ### Software Engineer (Graduate Teaching Assistant) @ Virginia Tech Jan 2024 – Jan 2025 | Arlington, Virginia, United States ◦ Real-Time Data & Analytics Platform: Architected a system to collect live GPS streams for real-time construction safety analytics. Designed the backend logic to process location data and power dynamic heatmap visualizations, delivering actionable hazard insights via a cross-platform application. ### Software Engineer (Research Assistant) @ Virginia Tech Jan 2024 – Jan 2024 ### DevOps Engineer @ Hiver Jan 2022 – Jan 2023 Internal Developer Platform (IDP): Engineered a remote development environment to replace AWS Cloud9, enabling engineers to code directly in Kubernetes. Built a Node.js CLI wrapper for DevSpace and Helm to standardize workflows, boosting developer productivity by 25%. Automated QA Infrastructure: Maintained and improved an ephemeral testing platform that provisions isolated Kubernetes namespaces for every git feature branch. Automated the deployment of microservices using Node.js, enabling parallel QA testing and reducing release cycle time. Kubernetes & Orchestration: Managed 650+ microservices across 6 EKS clusters using Helm and ArgoCD, ensuring high availability and seamless auto-scaling for production workloads. Infrastructure as Code (IaC): Automated AWS resource provisioning and configuration management using Terraform and Ansible, eliminating configuration drift and enforcing infrastructure standards. Observability & Monitoring: Established a robust monitoring stack with Datadog, Prometheus, and Grafana, cutting mean time to detect (MTTD) by 45%. Deployed custom metric exporters to improve system visibility. High-Scale Logging: Maintained a 30TB+ logging pipeline using Elasticsearch, Logstash, Fluentbit, and Kibana (ELK). Optimized sharding and indexing \ to lower query times by 60% during incident response. CI/CD: Streamlined deployment pipelines using Python, Bash automation scripts, and GitHub Actions, reducing deployment overhead and ensuring reliable releases. Global Traffic Management: Optimized multi-region load balancing (Layer 4/7) to reduce API latency by 20% and handle sudden traffic spikes without service disruption. Linux Systems Administration: Managed a fleet of 100+ Linux servers and provisioning to reduce manual operations by 50% and ensure consistent system performance. Cloud Cost Optimization: Reduced Amazon EKS and EC2 costs by $120,000 annually (10%) through automated rightsizing and Spot Instance usage. ### DevOps Engineer @ Hiver Jan 2022 – Jan 2022 ### Cloud Engineer @ Sapio Analytics Jan 2021 – Jan 2021 Revamped AWS cloud operations through automation, serverless migration, and enhanced security, achieving significant efficiency gains and cost savings. ### Web Developer @ A. P. Shah Institute of Technology, Thane Jan 2019 – Jan 2020 Optimized college recruitment processes by developing an integrated portal and interactive dashboard, streamlining job postings and student applications. ## Education ### Master's degree in Computer Science Virginia Tech ### Bachelor's degree in Computer Engineering University of Mumbai ## Contact & Social - LinkedIn: https://linkedin.com/in/vedant-mhatre - Portfolio: https://vmhatre.com/ --- Source: https://flows.cv/vedantmhatre JSON Resume: https://flows.cv/vedantmhatre/resume.json Last updated: 2026-04-10