Leading technical initiatives for production ML data infrastructure supporting agricultural robotics and autonomous systems.
Technical Leadership - Production ML Data Curation Platform:
• Led end-to-end architecture and implementation of image curation pipeline processing millions of images, integrating online calibration (OCAL), stereo rectification, and distributed inference systems using Databricks and Apache Airflow
• Drove cross-functional collaboration with CVML, Infrastructure, and Safety teams to align inference serving split architecture, resource governance, and complex multi-team technical dependencies
• Achieved 40% cost reduction while increasing throughput by optimizing GPU-to-CPU rectification, implementing dynamic batch sizing, and establishing cost analytics framework
• Established production reliability standards through RCA resolution, robust logging/monitoring infrastructure, and John Deere compliance data governance
Strategic Platform Modernization & Infrastructure Leadership:
• Led strategic platform migration from Kubeflow to Databricks with Airflow orchestration, designing modular architecture and coordinating multi-quarter roadmap
• Architected multi-threaded data ingestion pipeline for HALO camera data and MCAP bag extraction, delivering scalable foundation for autonomy state transition analytics
• Established hybrid cloud infrastructure collaborating with infrastructure team on distributed compute orchestration and cost optimization
• nDrove security initiatives including automated vulnerability scanning, ECR lifecycle management, and IAM policy governance
Technical Mentorship & Team Impact:
• Provided technical guidance to CVML engineers on Databricks ML pipeline, sharing expertise in distributed systems and Spark optimization
• Established code standards and development frameworks adopted across broader BRT organization
• Championed knowledge sharing through technical presentations, design reviews, and cross-team collaboration