•Launched and improved data pipelines for generating video, scanner, telemetry, and metadata files from 100s+ robot stations, uploading 100s TB+ to S3, processing into Robot Data Platform (RDP) via SQS, Kubernetes, Python services on AWS + Terraform
•Built RDP with Django, PostgreSQL (RDS) on AWS (EKS), handling 1000+ requests/sec peak for AI researchers, robot app developers, and deployment engineers to access, filter, and modify data via web UIs and Python client
•Designed annotation pipelines using RDP, S3, SQS, Lambda functions to send images and videos to vendors so the enriched data can be used for machine learning model training and robot performance metrics
•Full list of tech used: Python, Django, pytest/mypy/black, Kubernetes, docker/docker compose, SaltStack, PostgreSQL, Terraform, AWS (EKS, ECR, RDS, SQS, SNS, S3), Datadog (metrics monitoring), Github Actions (CI), Bazel (build system), gRPC/protocol buffers, HTTP, PyTorch, Ray, SkyPilot