# Sherin Thomas > Streaming | Data Infra | Climate Change Research Location: San Francisco, California, United States Profile: https://flows.cv/sherin My super power is taking an abstract problem and building a 0 -> 1 solution. I have extensive experience in Data Infrastructure(particularly Streaming) and Distributed Systems and Governance ## Work Experience ### Senior Staff Software Engineer @ Slack Jan 2024 – Present | San Francisco, California, United States I help build data infrastructure and ingestion systems that powers Slack's data warehouse, query engines and data lake. Additionally, I also lead data governance and streaming efforts. ### Staff Software Engineer @ Chime Jan 2021 – Jan 2024 | San Francisco, California, United States Doing a few different things in this role: 1. Building out the Streaming Platform from scratch - I've written about this here: https://medium.com/p/4c3ee2568a76 2. Leading the Data Lake effort 3. Leading the Data Discovery and Governance effort for all classes of data at Chime ### Technical Advisor @ SpaceML Jan 2020 – Jan 2022 | San Francisco Bay Area SpaceML is an offshoot of NASA's Frontier Development Lab, focussed on solving the problem of climate change and facilitating space research with AI at scale. Started by a group of citizen scientists, and industry professionals such as myself, we are solving the problem of auto detecting important weather phenomenon such as hurricanes, wildfires, polar vortexes, ice caps melting, and others from petabytes of unlabelled data. I lead the team that is working on productionizing a self supervised reverse image search using deep learning, on earth's imagery collected by NASA satellites. The goal is to eventually make this product open source and available for any kind of datasets not limited to earth science. In the role of a technical advisor and lead, I'm helping design the overall system and providing technical guidance to a group of citizen scientists. ### Senior Software Engineer, Realtime Data Infrastructure @ Netflix Jan 2020 – Jan 2021 | San Francisco, California, United States I work in the Realtime Data Infrastructure team at Netflix. In this role I'm helping build the next generation, self service data movement and processing platform using Kafka, Flink, Mantis, Iceberg. Currently focussed on pioneering the next generation engine agnostic SQL abstraction over Flink, Mantis and Kafka. Also working to improve user confidence in the platform through reliability improvements, data quality audit frameworks, end-to-end latency tracking, anomaly detection, error attribution etc. ### Senior Software Engineer, Streaming Platform @ Lyft Jan 2017 – Jan 2020 | San Francisco, California, United States I work in the Streaming Platform team at Lyft with a focus on improving Machine Learning through the use of realtime streaming data. With the hope of making streaming generally accessible across the company for all kinds of use cases, I have been building a self service platform that makes it easy for users to specify complex aggregations on streaming data declaratively. The platform takes care of all the heavy lifting behind the scenes(completely abstracted away from the user) like data discovery, resource provisioning, scale up, scale down, bootstrapping, schema management etc. This platform is currently used mainly for Machine Learning feature generation but there are several use cases that leverage the event driven programming capabilities. I have presented my work at several conferences including QCon, Flink Forward, Beam Summit, Women Who Code, Scale by the Bay. Scheduled to speak at Strata New York in September(although because of COVID-19, the conference may be conducted virtually) ### Senior Software Engineer @ Twitter Jan 2015 – Jan 2017 | San Francisco, California, United States In my 3 years at Twittter I worked on various parts of the Advertising Platform as well as the Direct Messing promoted product. My work touched all parts of the tech stack as well as several Twitter advertising products such as Ads Analytics, products for small and medium businesses, direct messaging APIs for building user interactions, analytics and bots. ### Software Engineer @ Google Jan 2013 – Jan 2015 | San Francisco, California, United States I worked on the Data Center system monitoring and visualization product at Google. This product collected system metrics from power and cooling equipments from Google's data centers and persisted it for running aggregations, visualizations and alerting. My work focussed more on the aggregation and alerting part of the product. At my time at Google I also worked on building a knowledge graph over data center equipments and hierarchy for each contextual querying and aggregation of metrics. ### Member of Technical Staff @ VMware Jan 2012 – Jan 2013 | Palo Alto, CA Services and framework for test automation and performance benchmarking. ### Student @ University of Florida Jan 2010 – Jan 2012 Master of Science in Computer Science GPA: 3.78/4.0 ### Student Programmer, IT Services @ University of Florida Jan 2011 – Jan 2012 Part time student programmer for the Housing Management System at University of Florida. ### MTS Intern @ VMware Inc Jan 2011 – Jan 2011 | Palo Alto Framework for test bed setup and performance-benchmarking tool, with the main goal of automating the test of all core operations of VMware Fusion(Hosted virtualization tool for Mac). ## Education ### MS in Computer Science University of Florida ## Contact & Social - LinkedIn: https://linkedin.com/in/thomassherin - Website: http://www.corexprts.com - Website: http://www.tcs.com --- Source: https://flows.cv/sherin JSON Resume: https://flows.cv/sherin/resume.json Last updated: 2026-04-01