# Saksham Arora > New Ventures @ Roivant | Dartmouth Location: New York, New York, United States Profile: https://flows.cv/sakshamarora Passionate Software Engineer with experience in machine learning and quantitative modeling spanning research and industry, specializing in natural language processing, bioinformatics and mobile security. You can reach me at: saksham.arora.23@dartmouth.edu. ## Work Experience ### Founding Engineer @ Savant Bio Jan 2025 – Present | New York, United States Transforming medical data into actionable insights. ### Software Engineer — AI Product (Roivant Health) @ Roivant Sciences Jan 2025 – Present | New York, United States Building 0 to 1 @ Savant Bio (Roivant's latest health technology incubation) ### Software Engineer — ML/Product, Real‑World Evidence (RWE) @ Roivant Sciences Jan 2024 – Jan 2025 | New York, United States Worked on accelerating patient pre-screening for clinical trial recruitment ### Software Engineer — Voice AI (Computational Research, Sumitomo Pharma America) @ Roivant Sciences Jan 2024 – Jan 2024 | New York, United States Working under Dr. Carson Tao at the Computational Research Team at Sumitomo-Pharma America (contracted) - Architected and implemented an end-to-end automatic speech-to-text system (from data pipelines to model re-training) that transcribes and de-hallucinates doctor-patient conversations for the extraction of meaningful audio and linguistic signals. - Prompt-engineered and deployed LLMs such as GPT-4, GPT-3.5, Mistral-7B, Llama-3 for speaker diarization with zero-shot, few-shot and chain-of-thought learning approaches. - Fine-tuned Mistral-7B-Instruct and Llama-3 models using PEFT-based approaches, including LoRA, QLoRA, DoRA, and QDoRA, for speaker diarization pipeline, achieving state-of-the-art results. - Conducted evaluation of internal SMPA pipeline against open-source and proprietary ASR models from Google Cloud, Amazon, and Microsoft Azure and achieved SOTA results for multi-lingual recordings. ### Software Engineer — Data Engineering (Sumitomo Pharma America) @ Roivant Sciences Jan 2023 – Jan 2024 | New York, United States - Orchestrated distributed ETL pipelines for a production Redshift data warehouse that streamlines sales team tracking and analysis for improved customer targeting. - Led the design and implementation of an automated AWS Lambda-based ETL pipeline, extracting and integrating data incrementally from RAVE Medidata API into AWS Aurora. - Created 20+ data jobs with SCD2 and history tables using SQL and Matillion for Redshift. - Collaborating with data engineers and Urovant consultants to normalize pharmaceutical data. ### Undergraduate Researcher @ The Dartmouth Institute for Health Policy & Clinical Practice Jan 2022 – Jan 2023 | Hanover, New Hampshire, United States * Fine-tuned BioBERT, ClinicalBERT, ClinicalLongformer language representation models for automated patient chart review for identification of care preference documentation. • Sped up data wrangling and analysis by 50x by building custom software packages using PySpark for ETL between PostgreSQL servers and ML models. • Developed data extraction pipeline for transforming raw EHR data for contained in MIMIC-IV database into dataframes for easy integration into existing ML pipelines using dask, pandas, numpy libraries * Received $5000 USD in research funding, awarded High Honors in Computer Science, submission accepted at IEEE ICSC/AIMHC '24 ### Computer Science Teaching Assistant @ Dartmouth College Jan 2021 – Jan 2022 | Hanover, New Hampshire, United States ### Computer Science & Mathematics Tutor @ Dartmouth College Jan 2019 – Jan 2022 | Tutor Clearinghouse, Hanover, New Hampshire ### Research Assistant for Prof V.S. Subrahmanian @ Dartmouth College Jan 2020 – Jan 2021 | Hanover, New Hampshire, United States Under the supervision of Professor V.S. Subrahmanian at The Institute for Security, Technology, and Society (ISTS). Working on Android malware detection, machine learning, and smartphone application security. ### Intern @ Netaji Subhas University of Technology (NSUT), Delhi Jan 2018 – Jan 2018 | Dwarka Area, India Working under Dr. Dhananjay V. Gadre, ECE Department. Working with embedded systems and their applications, such as Arduino and MSP430. ## Education ### Mathematics in Computer Science Dartmouth College Jan 2019 – Jan 2023 ### Delhi Public School - R. K. Puram Jan 2004 – Jan 2018 ## Contact & Social - LinkedIn: https://linkedin.com/in/saksham324 - Website: https://sakshamarora.me --- Source: https://flows.cv/sakshamarora JSON Resume: https://flows.cv/sakshamarora/resume.json Last updated: 2026-04-01