# Sarah Dias Barreto > AI/ML Engineer @ Aynak | Data Scientist | Data Analyst | MS in Data Science, Indiana University, Bloomington Location: Greater Boston, United States Profile: https://flows.cv/sarahdiasbarreto I’m a Data Scientist and AI Engineer passionate about turning complex, unstructured data into actionable insights that drive impactful decisions. With extensive startup experience across AI, combined with academic research and nonprofit analytics, I specialize in data extraction, statistical modeling, and building scalable predictive and AI solutions. Currently, I work as an AI Engineer at Aynak, focusing on advanced audio signal processing and intelligent systems. At Project 990, I engineered scalable, automated data ingestion and transformation pipelines, optimized and deployed large language models (LLMs) leveraging high-performance computing clusters, performed rigorous financial data extraction and causal inference using advanced statistical methods and developed interactive and high-fidelity Tableau visualizations including ACFR reporting, architect distributed nationwide grant allocation frameworks, I’m driven by a passion for using technology to build meaningful solutions that create real-world impact. I’m actively seeking full-time roles in Data Science, Engineering, or AI within tech, finance, and mission-driven organizations. Let’s connect to explore collaboration opportunities where data and AI create meaningful impact. When I’m not decoding data, I’m probably uncovering hidden gems to explore, curiosity fuels both my work and my adventures. ## Work Experience ### Founding AI Engineer @ Aynak, Inc. Jan 2025 – Present | San Diego, CA ### Data Engineer @ Project 990 Jan 2025 – Jan 2025 | Washington, District of Columbia, United States • Automated Alteryx workflows to filter and process 500K+ congregation records, reducing data preparation time by 38% and boosting data quality scores by 19%. • Led causal discovery with placebo testing, filtering 50% of models to improve effect estimation on education outcomes. ### Data Engineer @ Project 990 Jan 2024 – Jan 2025 | United States • Deployed zero-shot RoBERTa classifiers (NLP) for 500,000+ mission statements; accelerated inference time by 40% via DeepSpeed optimization. • Engineered scalable ETL pipelines, increasing data throughput by 35% and reducing integration errors by 23%. • Developed Tableau dashboards to distill complex data into strategic insights, accelerating decision cycles by 36%. ### Research Scientist - Advanced Data Analytics @ Indiana University - Kelley School of Business Jan 2025 – Jan 2025 • Accelerated multiverse analysis workflows by 16×, enabling rapid, large-scale sensitivity testing. • Directed longitudinal survival analysis on 1,200+ entrepreneurs using multiverse modeling (R), uncovering key health and business risk factors. • Analyzed 30+ high-dimensional behavioral and health attributes to identify drivers of entrepreneurial mortality and resilience. ### Data Science Consultant @ Public Budgeting and Finance Jan 2025 – Jan 2025 ### Data Science Intern @ Public Budgeting and Finance Jan 2024 – Jan 2025 | United States • Created an automated email service and a website to advertise the journal, leading to a 11% increase in research engagement. • Analyzed performance metrics post-deployment of both frontend and backend on a single Heroku dyno, which reduced hosting costs by 30% and improved API call response times by 22%, enhancing user experience. • Curated a repository of over 600 researchers, streamlining collaboration and boosting project matching speed by 25%. • Leveraged pretrained BERT models and cosine similarity to identify matching articles, improving matching accuracy by 85% and reducing researchers’ article search time by 42%. ### Data Science Research Assistant @ O'Neill School of Public and Environmental Affairs Jan 2024 – Jan 2025 | United States Extracted and synthesized financial insights from unstructured Annual Comprehensive Financial Reports (ACFRs) using NLP and data parsing techniques. ### Data analyst Intern @ Indiana University - Kelley School of Business Jan 2024 – Jan 2024 | Bloomington, Indiana, United States ### Software Development Intern @ Simplit Software Solutions Jan 2022 – Jan 2022 | Goa, India ### SWE Intern @ Persistent Systems Jan 2021 – Jan 2021 ## Education ### Master of Science - MS in Data Science Indiana University Bloomington ### Bachelor's degree in Computer Science and Engineering National Institute of Technology, Goa ## Contact & Social - LinkedIn: https://linkedin.com/in/sarah-dias-barreto - Portfolio: https://sarah-dias-barreto.netlify.app/ - GitHub: https://github.com/sarah9db --- Source: https://flows.cv/sarahdiasbarreto JSON Resume: https://flows.cv/sarahdiasbarreto/resume.json Last updated: 2026-04-17