# Gopu Nair > Staff Engineer | AI Engineering | AI systems | AI-ML Platforms| Software Engineering Location: Cupertino, California, United States Profile: https://flows.cv/gopu Staff software engineer experienced in building AI platforms and systems for the scale of eCommerce platforms. Salient and most recent impacts made include Building and managing highly scalable, distributed model training platforms for eBay with at-par features and all bells and whistles of a modern ML platforms ⏺ A polyglot platform that runs model training for users in their language of choice with custom and distributed training features ⏺ Features for platform robustness: DAG execution, workflow features and experiments, configurable compute capacity, easy to access logs and metrics ⏺ State management using apache zookeeper ⏺ Connectivity to various data marts (Big data platforms) and features stores including hot and cold data storage features ⏺ State of the art experiment management system and model management system with labelling versioning etc. ⏺ Hyper parameter tuning, HP metrics collection and plotting ⏺ Training / experiment metric collection (Time-series) and dashboarding using prometheus and grafana to measure and provide visibility to various training metrics and compute / memory optimization Polyglot software engineering with Java, Scala, GoLang and python Design and build highly efficient highly available Near Real Time (NRT) data pipelines towards building feature stores for model training towards computer vision features such as visual search, vector similarity search and user uploaded content moderation (video, image and text ) ⏺ Distributed Event driven systems built using Apache Kafka ⏺ Event driven Producers and consumers built using Spring cloud stream Apache Flink etc. ⏺ Built to scale: Systems scaled to produce and process billions of NRT everts per day (scale of one of the largest eCommerce platforms in the world) with negligible processing lag and optional compute capacity. ⏺ High Efficiency: Employed data aggregation techniques (Flink) and non-blocking reactive programming (Reactor) to arrange high efficiency and processing speed Extensive experience in Business transformation, Systems Integration and Customized Solution Software Development Life Cycle (all phases) with 2 of the US big 5 Consulting firms ⏺ Working with public and private sectors ⏺ Healthcare, Finance, Pension System sector experience Overall: Over 15 years of experience in designing and building high scale distributed systems using Java suite, GOLang, Spring suits, Oracle, No-SQL DBs, NRT systems, Hadoop stack, Kubernetes etc. A strong engineering professional with good interpersonal skills. ## Work Experience ### Staff Software Engineer @ Walmart Global Tech Jan 2023 – Present ### Member Of Technical Staff @ eBay Jan 2018 – Jan 2023 | San Jose, California Krylov - ML Platform: Part of the team building and managing highly scalable, distributed in-house AI/ML model training platforms - Krylov with at-par features and all bells and whistles of a modern ML platforms such as Google’s AI platform or Microsoft’s Azure. ⏺ Features for platform robustness: DAG execution of tasks, workflow features, configurable retries, configurable compute capacity, easy to access logs, metrics ⏺ State management using apache zookeeper managed by the team ⏺ Connectivity to Hadoop platform and features store ⏺ Custom workload and platform metric creation and collection using Kubernetes libraries and Nvidia agents (GPU metrics) ⏺ Training/experiment metric collection and dashboarding using prometheus and grafana Near Real Time data processing and data pipelines: Design and build highly efficient Near Real Time (NRT) data pipelines towards building feature stores and big data for computer vision features such as visual search, vector similarity search and user uploaded content moderation (video, image and text ) ⏺ Distributed Event driven systems built using Apache Kafka based off of eBay platform for data streaming ⏺ Built to scale: Systems scaled to produce and process billions of NRT everts per day (scale of eBay listings and text/image/ video uploads) with negligible processing lag and optional compute capacity. ⏺ Efficiency: Employed data aggregation techniques (Flink) and non-blocking reactive programming to achieve high efficiency and processing speed BigData Platform: Build and maintained eBay's in-house Hadoop platform. eBay has world class hadoop clusters with more than 3000 nodes and petabytes of data. Features include Spark , Hive, HBase, data governance tools, data intelligence tools, job scheduling and tracking tools etc. Data platform: Part of the team that maintains and enriches eBay's Data access layer(DAL). A highly available system that manages eBay's database infrastrure and resposible for site availablility. ### Sr Engineer @ eBay Jan 2015 – Jan 2018 | San Jose ### Technical Lead @ Gap Inc. Jan 2013 – Jan 2015 | San Francisco, California, United States ⏺ Leading a team building Java based systems for the retail inventory management, Supply chain warehouse management, logistics, invoicing and billing ⏺ Responsible for design, development, deployment to production and production performance and application monitoring ⏺ Full stack development experience - from provisioning VMs/ writing Chef cookbooks to design development and deployment of application in TDD and pair programing environment ⏺ Test Driven Development with AGILE Application Development scrum team with pair programming ### Technology Consultant @ Deloitte Jan 2011 – Jan 2013 Part of the success story to migrate Califoria's EDD from legacy systems to modern web system with zero disruption to customer experience Completion of health care reform related changes for 2 of the leading health insurance providers in the US ⏺ Leading efforts for End to End solution from Requirement Gathering, solution design and implementation. ⏺ Design and development of new web based systems as well as integration of newly developed systems with the legacy systems and creation of interfaces ⏺ Experience working in AGILE and Waterfall methodologies ⏺ Implementation of web MVC, Web services, SQL Data bases, Spring services, SOA, JSP, JSDL and Java script ⏺ Experience in working with all phases of SDLC ⏺ Team leading and Joint Application Design experience ⏺ Experience working with geographically distributed teams including offshore teams ### Sr Software Engineer @ Accenture Jan 2005 – Jan 2011 Part of California Pension System Reform (PSR) delivery, which is the largest public sector pension fund in the world. This was the fifth and finally successful attempt to build a modern web based system to integrate various disconnected systems. Part of single sign on system implementation using modern web technologies to integrate various parts of health insurance business of a major US Health insurance giant as a overhaul and modernization of their fragmented systems used to manage various parts of their business. ⏺ Design and development of large scale multi-tier web based systems Full stack development projects using java and related technology stack ⏺ Technical implementation of business solutions in various industries using Java and related Technologies ⏺ Hands on design and development of UI and controller components of the applications with Java spring framework, JSP, JSTL and Java script ⏺ Experience creating DB layer of the applications using Spring Framework ⏺ Extensive experience with RDBMS (Oracle) , including creation of complex queries and procedures ⏺ Experienced in customizing and deploying the applications in Web/Applications servers like Oracle ⏺ WebLogic Application server, IBM WebSphere Application Server and IBM WebSphere Portal Server ## Education ### Computer Science University of Calicut ## Contact & Social - LinkedIn: https://linkedin.com/in/gopunair --- Source: https://flows.cv/gopu JSON Resume: https://flows.cv/gopu/resume.json Last updated: 2026-04-12