# Sumit Lamba > Staff Engineer | Distributed Data Systems | Spark, Kafka, Airflow, Snowflake | Real-time & Batch Pipelines | AWS | ETL Pipelines | Data Infrastructure Location: San Francisco Bay Area, United States Profile: https://flows.cv/sumitlamba Staff Software Engineer specializing in Data Platforms and large-scale data infrastructure. I have 10+ years of experience building distributed data systems, scalable ETL pipelines, and event-driven architectures that power analytics and business decision-making. My core expertise includes batch and streaming data pipelines, data modeling, data warehousing, and cloud-native infrastructure on AWS. I enjoy solving complex data engineering problems involving distributed processing, performance optimization, and data reliability. Technologies: Python, Spark, Airflow, Kafka, dbt, Snowflake, SQL, Java, AWS (S3, EC2, EKS), Docker, Terraform, and event-driven microservices. ## Work Experience ### Staff Software Engineer - Data @ SoFi Jan 2023 – Present Architected event-driven data pipelines for the advertising attribution platform using CDC ingestion, Kafka event streams, and Flink stream processing, delivering analytics-ready datasets into Snowflake for real-time campaign performance insights. ### Senior Data Engineer @ SurveyMonkey Jan 2020 – Jan 2023 ### Software Developer @ realtor.com Jan 2016 – Jan 2020 | San Francisco Bay Area ### Research Assistant @ The Research Foundation for SUNY Jan 2015 – Jan 2016 • Sentiment Analysis of 6000+ tweets and mails to identify wedge-driven sentiment from Boston Bombing data • Content Analysis and data coding of 3000+ Phishing mails to decipher risk communication patterns • Running statistical analysis using regression model on the filtered data to understand the behaviour regarding different phishing mails. ### Senior Software Engineer @ Infosys Jan 2013 – Jan 2015 | Chandigarh Area, India • Interacted with mutiple stakeholders to gather and analyse end user requirements • Developed SQL queries, joins and procedures to extract, transform and validate between source and target in Oracle database •Designed test strategy, test plan and test cases using HPQC for system testing, regression testing and System integration testing. •Identified and removed unwanted data, optimized queries and procedure to improve system performance •Coordinated with multiple stakeholders (client, development and support teams) and resolved conflicts and open items on daily basis. • Led team of four associates in finance modules of data migration requirement. • As a SME(subject matter expert) for Financial Reconciliation activity, audited test techniques implemented across various modules. •Performed end-to-end test management and defect management. Key Competencies and skills learned - Team Management - Customer Management - Business analysis - Software Development Life Cycle - Knowledge Transfer Process - Quality Assurance ### Software Engineer @ Infosys Jan 2010 – Jan 2013 •Developed Selenium test scripts for account creation, web navigation & form submission functionalities •Connected Selenium to MySQL, performed joins & sub queries to combine tables & processed result-sets validating against UI •Designed cross browser test scripts to validate JavaScript code in multiple browsers reducing manual work for offshore team •Developed Excel Macros to automate test planning steps reducing manual effort ## Education ### Master's degree in Management Information Systems University at Buffalo School of Management, The State University of New York ## Contact & Social - LinkedIn: https://linkedin.com/in/sumit-lamba --- Source: https://flows.cv/sumitlamba JSON Resume: https://flows.cv/sumitlamba/resume.json Last updated: 2026-04-12