# Raghava Srinivasan > Data engineer Location: San Francisco, California, United States Profile: https://flows.cv/raghava Data Engineer with 6+ years of experience architecting and optimizing scalable data infrastructure, distributed systems, and algorithmic optimizations. Expertise in designing scalable ETL workflows, stream processing (Flink, Kafka), and cost-efficient cloud-native solutions (AWS, Kubernetes). • Built data pipelines processing 600K+ events/sec and 25TB/day, ensuring 99.99% availability • Reduced operational costs by 50% through optimized lookup solutions and Union-Find-based deduplication • Deployed three MVPs in six months, enabling rapid product-market fit iterations ## Work Experience ### Software Engineer, Data @ MileIQ Jan 2024 – Present | San Francisco, California, United States - Developing an A/B/n testing framework integrated with LLM (Code Llama), vectorDB, and graph DB to map RDBMS relationships, enabling intelligent Snowflake queries for enhanced data-driven decision-making. - Deployed a scalable real-time data pipeline for drive classification, leveraging advanced spatial-temporal feature engineering to enhance ML model accuracy by 10%. ### Lead data engineer @ Lentra Jan 2022 – Jan 2024 - Led a 30-member engineering team in developing a real-time profiling system for BFSI, processing 600K events/sec and 25TB/day, ensuring 99.99% availability. - Implemented a Union-Find-based graph deduplication system, clustering similar data entities in O(α(n)) time complexity, achieving 50ms lookup speeds and reducing storage/operational costs by 50%. - Designed and deployed facial recognition and fuzzy matching pipelines, increasing identity verification accuracy by 15%. - Pivoted from AWS to Kubernetes in 3 months, cutting setup time and standardizing deployments across financial institutions to meet evolving business needs. - Collaborated with PMs to prioritize and deliver features that drove a 20% increase in MAU growth. ### Senior Data Engineer @ Presidio Jan 2021 – Jan 2022 | Chennai, Tamil Nadu, India - Built and optimized ETL pipelines handling 3GB/day of data, improving processing efficiency by 30% using AWS Glue, Kinesis, and Step Functions. - Implemented incremental data ingestion pipelines, reducing compute costs by 30% via delta calculation techniques. ### Data Engineer @ Presidio Jan 2020 – Jan 2021 - Developed Kimball-modeled data warehouses in Snowflake, improving analytical query performance by 5%. - Automated cloud infrastructure provisioning with CloudFormation, accelerating deployment cycles by 1.5x. ### Software Engineer @ Presidio Jan 2019 – Jan 2020 Full-stack development and Linux shell scripting - Worked on full-stack development for a marketplace application using Angular 8 and NodeJS. - Worked on full-stack development for an accelerator product using Angular 8 and Python. - Worked on Linux shell scripting to check best-practice implementations in AWS. ### Associate - Projects @ Cognizant Jan 2018 – Jan 2019 - Built a Python-based data pipeline for clustering and fuzzy matching, improving text deduplication accuracy. - Integrated sentiment analysis, enhancing the depth and precision of data insights. - Developed an NLP-powered name generation script, streamlining data analysis efficiency. ### Intern @ Cognizant Jan 2018 – Jan 2018 - Built a supervised KNN multi-label classifier in Python for .xlsx data - Leveraged WEKA for data mining, feature extraction, and dataset cleaning. - Developed a J48 decision tree, enabling efficient large-scale data classification. - Designed a path-finding algorithm, reducing computation time by 50% and boosting efficiency by 30%. ## Education ### Master of Science - MS in Data Science University of San Francisco ### Bachelor of Engineering (B.E.) in electronics and communication engineering Velammal Institute Of Technology ### High School in Computer Science The Velammal International School ### Jawahar Vidhyalaya Senior Secondary School ## Contact & Social - LinkedIn: https://linkedin.com/in/raghava-srinivasan --- Source: https://flows.cv/raghava JSON Resume: https://flows.cv/raghava/resume.json Last updated: 2026-04-05