# Gurmeet Singh > Staff Software Engineer Location: Sunnyvale, California, United States Profile: https://flows.cv/gurmeetsingh Technical leader in open format data lake technologies with in depth experience in data analytics and storage. Leading KPI based cross team projects driving storage and compute efficiency for systems serving millions of queries per day and serving petabytes of data. Experience in building systems using open source technologies like Kafka, Pinot, Presto, Hudi, Hive, HDFS, M3, Graphana, Lookerstudio. ## Work Experience ### Staff Software Engineer @ Uber Jan 2020 – Present | Sunnyvale, California, United States Working on interactive Data analytics at Uber using Presto. Focus on Data/Query optimization. Led Presto Query Governance project at Uber using real time insights for scheduling analytics workloads. Led a Data layout formatting project across Hudi and Presto teams to reformat (parquet) data in order to make queries more efficient. Reduced data read by 25 PB/week. Co-led a Query result caching project across Business Intelligence (BI) and Presto teams to serve queries using cached results. Architected and drove Presto express project to make the majority of Presto queries faster using just a fraction of the resources. Designed and architected a data sampling project to auto rewrite Presto queries using sampled data. ### Software Engineer @ Facebook Jan 2017 – Jan 2019 | Menlo Park - feature engineering, rules engine, machine learning and enforcement for payments risk engineering on FB platform ### Senior Software Engineer @ Robin Systems Jan 2013 – Jan 2017 - One of the founding engineers in the company - Implemented synchronous replication in a logical volume manager specially designed for container based storage. Designed and implemented both the control plane and data plane protocols. Implemented resynchronization after failure. (C) - Implemented a Data Lake by doing a federation of multiple HDFS namenodes (Java/C++) - Implemented a read cache on top of HDFS for performance (Java) ### Principal Engineer @ Arkologic Jan 2012 – Jan 2013 Architecture, Design and Implementation of storage virtualization subsystem to abstract the storage from the file system layer. Separation of components between the kernel and userspace for robustness and fault tolerance. Development of resource manager in user space for managing disk resources. Design of In place upgrade of file system from physical to virtual storage layer. Development of libraries for memory allocation management and fault injection framework. REST based interface for client programming. Smoke, Nightly and Regression test strategy development. Continuous and Torture test strategy development. C/Python based development. ### Member of Technical Staff @ Network Appliance Jan 2008 – Jan 2012 - Design and development of data protection and disaster recovery solutions for primary storage. - Synchronous and Asynchronous replication of file systems. - Design and development of logical replication with storage efficiency; preserving on the wire data duplication from primary to secondary. - Data preserving In-Volume snapshot restore - Performance tuning and optimization in LAN and WAN environments. ### Graduate Research Assistant @ Information Sciences Institute Jan 2002 – Jan 2007 Research on resource management (scheduling of tasks) in high performance computing systems. ACADEMIC PROJECTS Pegasus: Planning for Execution in Grids, http://pegasus.isi.edu, (Team Member) 2002-2007 Conducted research in resource provisioning techniques for future generation workflow systems. Developed web based tools for management & execution of workflows over Distributed systems using the Pegasus workflow management system. Metadata Catalog Service (MCS), http://mcs.isi.edu, (Chief Developer, architect) 2003-2005 Developed relational database scheme , API, WSDL, Service side implementation for the catalog Developed a web application as a GUI frontend for the catalog Developed a test harness and conducted performance testing on MCS in a Distributed environment. Modified the OGSA-DAI Query engine for interfacing the MCS Database Developed a XML based interface for MCS queries and responses Virtual Grid Application Development Software (VGrADS), vgrads.rice.edu, (Member) 2005-2007 Conducted research in economic based resource management heuristics for distributed systems Conducted research in multi-criteria resource provisioning and scheduling for workflows in distributed systems. National Virtual Observatory (NVO), http://www.us-vo.org, (Student Member) 2002-2005 Involved in parallelization and Grid enablement of astronomical applications using Pegasus ## Education ### PhD in Computer Science University of Southern California ### Bachelor's Degree in Electronics Engineering Motilal Nehru National Institute Of Technology ## Contact & Social - LinkedIn: https://linkedin.com/in/gurmeetsingh123 - Portfolio: https://sites.google.com/site/sgurmeet/ --- Source: https://flows.cv/gurmeetsingh JSON Resume: https://flows.cv/gurmeetsingh/resume.json Last updated: 2026-04-12