# Alok Subbarao > Founding Forward Deployed Engineer at Vori Location: Palo Alto, California, United States Profile: https://flows.cv/aloksubbarao Versatile data professional with expertise in Data Engineering, Analytics Engineering, MLOps/AI, ETL/ELT pipelining, orchestration, and cloud warehouse & distributed systems architecture. Proven track record at Meta with 4+ years implementing advanced AI/ML data solutions at web scale. I have a diverse career background across FAANG companies, tech startups, and Healthcare, Pharma, and Medical Device industries. My professional experience encompasses Product Analytics/Data Science, Dashboarding/Visualization, Machine Learning, with a personal passion for efficiency, cost reduction and query optimization. I implemented the full ML development cycle to launch production models for Meta's Integrity (Trust and Safety) unit to combat Account Compromise and revenue leakage on Meta's Ads and Business platform. I launched fraud detection models by developing ground truth test/training data and transliterating algorithm prototypes into robust ETL processes. I completed the ML dev cycle by implementing operational monitoring of model output via pipelines to track precision and recall, measuring temporal performance of model variants, and finally shipped feature engineered datasets for iterative enhancement for detection models operating on a scale of >1B entities on the Meta Business Graph. My domain knowledge areas of expertise include fraud detection on Meta-scale data (1B+ advertisers, 500M+ businesses), as well as expertise in the Pharmaceutical and Medical Device sectors, including patient claims, FDA regulations, and healthcare data management. My passion involves all forms of data optimization — validation, performance profiling, cost reduction, query optimization and ETL rearchitecture. I bring deep knowledge of various Cloud Data Warehouses (Hive, Snowflake, AWS Redshift, GCP BigQuery), columnar and NoSQL query engines (Presto, Spark, Cassandra, Redis, Aurora), and relational databases (MySQL, Postgres, OracleDB, MS SQL Server), and other areas such as orchestration (Airflow, Docker), analysis and reporting, experimentation, and dashboarding tools (Tableau, Sisense, Metabase) python libraries (Plotly, Seaborn, Matplotlib). ## Work Experience ### Founding Forward Deployed Engineer @ Vori Jan 2025 – Present | San Francisco, California, United States ### Senior Data Engineer @ Shockwave Medical Jan 2025 – Jan 2025 | Santa Clara, California, United States ### Senior Analytics Engineer @ Pfizer Jan 2023 – Jan 2024 | United States As a founding member of the greenfield Commercial Analytics Engineering team at Pfizer, I owned warehousing and migration of source of truth for 20+ Pfizer vaccine and drug portfolio brands including Covid/Flu, Oncology, Women's Health, Gastroenterology, and Rare Diseases by constructing ETL pipelines and generating datasets in a modernization initiative to increase delivery of business insights from Data Science machine learning models. I built source-of-truth datasets for usage in Data Science machine learning forecasting models and executive dashboards, replatformed legacy ETL processes from Dataiku DSS (data science studio) into a modern data stack using Snowflake, Airflow, Docker, and Github Actions, while improving data quality, achieving significant reductions in cost, and delivering dataset enhancements. I validated and created brand-level datasets which consolidated pharmaceutical patient claims, sales and shipments, healthcare provider, and drug distribution data. I shipped internal API tooling and rearchitected ETL pipelines and reduced runtimes of the heaviest workloads 50%+ and reducing snowflake compute from L to S/XS (8x reduction) for Pfizer's Oncology datamart, profiling inefficient processes in Dataiku DSS running on Spark, S3, EC2, and Snowflake, and replatforming onto AIrflow while refactoring queries and rearchitecting transformations achieving 75% reduction for both storage requirements by and cluster compute size, while implementing data quality checks and reducing data source latency from a monthly to weekly latency for executive dashboards. ### Data Engineer @ Meta Jan 2023 – Jan 2023 | Menlo Park, California, United States ### Senior Product Experience Analyst, Data Engineering @ Meta Jan 2022 – Jan 2023 | Menlo Park, California, United States Lead for Business Compromise in Cross-Meta Integrity (XI). Worked in detection, investigation, and recovery of compromised Business Manager clients to secure platform integrity for Meta’s 700MM+ advertisers. Developed automation tooling for investigation and recovery, ML pipelines and feature engineering, credit and payment risk frameworks for all Meta Business Manager clients generating $95B+ annual revenue - including all High Value Businesses ($80B ARR) and Small Spend Advertisers ($15B). Data Engineering and Analytics: Developed and maintained critical data pipelines and frameworks used cross-functionally by Meta Integrity teams to triage, prioritize and recover compromised businesses and for topline metrics and goalin. Owned and maintained primary source-of-truth critical infrastructure datasets and Business/User Graph relationship datasets, deployed ML models and created feature engineering datasets for machine learning detection classifier models. Data Stack/Tooling: Hive, Spark, Presto/Trino, MySQL, Python, Dataswarm (Airflow), Bento (Juypter), Tableau/Unidash/Daiquery (SQL IDE + Dashboarding tools), Scuba (Grafana equivalent), Loggers, Backfill Software Engineering: developed Autopilot, a system to automate point-of-compromise detection and Automated Investigation and Recovery (AIR). These were internal tooling and infrastructure launched as central tooling for Business recovery workflows used quarterly on tens of thousands of businesses, including multi-million dollar High Value Business clients. Performed full-stack work with backend focus: primarily product tooling, logging, and integration with Data Warehouse, and some UI development. Tech Stack/Tooling: Hack (PHP), EntQL (GraphQL wrapper), React/Javascript, chronos Python ETL, Scuba (in-memory realtime analytics DB), Laser (in-memory key-value store). ### Product Experience Analyst @ Meta Jan 2019 – Jan 2022 | Menlo Park, California Data engineering/pipeline/ETL for Voice of Customer dataset, analyses and dashboard creation, and creation of associated upstream data pipelines ingesting customer feedback, Support and Operations correspondences, Help Center feedback, Developer API feedback, and more into a single, comprehensive dataset containing timeseries metadata. Performed Top Issues Mapping text mining/segmentation analysis to size customer pain by product area and NLP topic modeling and clustering analysis on raw text feedback, plus dashboarding and visualization of feedback data. ### Data Science Coach @ Interview Query Jan 2021 – Jan 2023 | United States Coaching for Product Analytics and Data Science job candidates and Mock Interviews for SQL, business sense, and product analytics. ### Senior Professional Services Engineer @ Periscope Data (Sisense for Cloud Data Teams) Jan 2018 – Jan 2019 | San Francisco, California, United States Data Analyst-as-a-Service embedded consulting model for Sisense clients: Uber, Tinder, Side Labs, Seasoned.co, Bungalow, and other partners. Provided end-to-end analytics, data science, and data engineering services: ETL/warehousing, query optimization; cohort analysis, segmentation, regression modeling, dashboarding/tooling/visualization using Periscope Data/Sisense, Python, and SQL. ### Associate Solutions Engineer @ Periscope Data (Sisense for Cloud Data Teams) Jan 2017 – Jan 2018 | San Francisco Bay Area I provided real-time SQL query and data visualization support to hundreds of Sisense customers, primarily data scientists, data analysts, business analysts, and product managers. DB flavors I worked with included Amazon Redshift, Google Bigquery, Snowflake Warehouse, MySQL, PostgreSQL, MSSQL Server, and AuroraDB. I also provided dashboarding and visualization consulting to our customers. ### Technical Service Engineer @ Nevro Jan 2016 – Jan 2017 | Redwood Shores, CA Provided product and sales support to external and internal customers in the U.S., Europe, and Australia for Senza, a Class III implantable neurostimulator. Nevro HF10 therapy provides high-frequency stimulation directly to the spinal cord, providing relief and increasing the quality of life of chronic pain patients Supported external customers (patients, surgeons, physicians, MRI technicians, nurses) and internal departments including Therapy Optimization, District Sales Managers & Regional Sales Directors, Therapy Support, Inventory Control/RMA, Quality Engineering, R&D, Marketing Developed scripting and automation tools using SQL Server, python, javascript, and Selenium IDE to increase efficiency of various Technical Services processes including Patient ID Card management and distribution, complaint database entry Data visualization, reporting/dashboarding, and ad-hoc analytics of patient, device, therapy efficacy, and complaint data to management using Tableau and SQL Server ### Graduate Researcher @ Stanford University Jan 2015 – Jan 2017 | Stanford, CA Khatib Lab, Stanford Robotics / Stanford Artificial Intelligence Lab Presented work in the Haptics talk track at IROS 2017, Vancouver, September 25 HFI-5 is a novel, 5-Degree-of-Freedom Haptic device which has been used in human neuroimaging experiments with the goal of studying the mechanisms of human motor control Presented HFI-5 research at IROS 2017, Haptics Session, Sept. 25, Vancouver, Canada Analyzed robot mechanical and test data, performed temporal noise analysis of experimentally obtained phantom fMRI datasets using MATLAB (SPM16 toolkit, kkutils) and Freesurfer 3D CAD design of parts, assemblies, and test fixtures using Solidworks, fabrication of parts using laser cutting (Epliog and Universal lasers), 3D printing, structural assembly using machine and shop tools Calibration, testing, and debugging of analog motors and power supply electronics, construction of electrical subsystem, MRI operational, noise, and electromagnetic compatibility testing, materials/components analysis and selection Master's research: Fabrication and Design of an fMRI-Compatible, 5 Degree-of-Freedom Haptic Platform ### Biomedical Engineer, Device Security @ Stanford Health Care Jan 2016 – Jan 2016 | Stanford, CA (Contract - SHC Clinical Technology & Biomedical Engineering) Cybersecurity analysis & risk mitigation project involving over 15,000 devices in use across SHC hospitals & clinics; developed medical device security risk and vulnerability framework Clinical and operating room equipment maintenance and patient interaction; cross-functional interaction with clinicians, nurses, IT services and security, and radiology ### Complaint Analyst @ Medtronic Jan 2015 – Jan 2016 | Sunnyvale (Contract) Investigation and management of pill endoscopy and manometry (pH monitoring) device records, ensured compliance per FDA Quality System and Reporting regulations (CFR 803, 820) ### Regulatory Postmarket Surveillance Analyst @ Intuitive Surgical Jan 2014 – Jan 2015 | Sunnyvale, CA (Contract) SQL database analysis of robotic surgery, medical record, legal claim form, and RMA data; used SQL server, excel, and python to compile and perform trending analysis of litigation data submitted to the FDA Generated, submitted, and developed processes for failure analysis reports of Intuitive Surgical's electromechanical surgical instrumentation and supplier endoscopes to hospitals and distributors Created instrument handling and reprocessing best practice guides for hospitals using da Vinci surgery; arranged for report translation into French, German, and other languages; maintained customer relations with hospitals and distributors in Europe and Asia ### Medical Surveillance Specialist @ Johnson & Johnson Jan 2013 – Jan 2014 | Milpitas, CA (Contract) Submitted MDR and MEDDEV forms to FDA, Canadian, and EMEA health authorities and assessed complaints for LifeScan blood glucose meters and Animas insulin pumps ### Intern @ TeleVital Jan 2011 – Jan 2011 | Milpitas, CA Installation and debugging of open medical record software (OpenEMR) in Linux (Ubuntu), graphic design, testing of tele-medical hardware (USB pen camera), general intern duties. Intern from June 2009 - Sept 2009 as well ### Intern @ UC Davis Center for Mind & Brain Jan 2010 – Jan 2010 | UC Davis Center for Mind and Brain Aided in research in Mangun lab. Debugging and programming visual processing reaction time experiments in "Presentation" ECG recording software, data compiling and analysis in Excel, general lab assistant duties. ### Intern @ Stanford University Jan 2007 – Jan 2008 | Stanford, CA Research project in Pringle Lab under Prof. John Pringle conducted in order to understand DNA replication in aquatic creatures. Performed lab duties such as anemone feeding, tank maintenance, anemone transfer, data collection, and assisted in cell counting via scanning electron microscopy. ## Education ### Bachelor of Science (B.S.) in Biomedical/Medical Engineering University of California, Davis ### Master's degree in Biomedical/Medical Engineering San José State University ### Palo Alto High School ## Contact & Social - LinkedIn: https://linkedin.com/in/alok-subbarao-8883bb52 --- Source: https://flows.cv/aloksubbarao JSON Resume: https://flows.cv/aloksubbarao/resume.json Last updated: 2026-04-10