# Songlu Li > Data Engineer/ Data Analyst in Biotech/ Pharmaceutical/ Healthcare Location: Boston, Massachusetts, United States Profile: https://flows.cv/songlu Dynamic Data Engineer with extensive experience in designing and optimizing data pipelines, data modeling, and data warehouse development. Demonstrated expertise in Python, SQL, and various data engineering tools. Adept at working in fast-paced environments, keen on supporting data-driven decisions through robust data solutions. ## Work Experience ### Software Engineer II, Data Engineering @ Intellia Therapeutics, Inc. Jan 2023 – Jan 2024 | Cambridge, Massachusetts, United States - Target Off-Target Screening Data Analysis: Participated in enhancing the automated generation of data packages for genetic editing analysis. Notable contributions include: - Automated the compilation of summary statistics for off-target genome editing data, optimizing analysis efficiency and accuracy. - Developed a process for streamlined assessment of site-level information, accelerating the validation of off-target activity. - Assisted in compiling rhAmpSeq data into comprehensive PowerPoint packages, setting a standardized format for data deliverables. - Engaged with cross-functional teams during user testing sessions, incorporating stakeholder feedback to refine data analysis tools and workflows. ### Data Enigneer @ Roivant Sciences Jan 2021 – Jan 2023 | United States - Produced 17 pipelines to ingest R&D experimental data /Real World Evidence data, which enabled Roivant to scale the number of ongoing projects without a large increase in the headcount and enabled reducing the number of licenses of a commercial drug discovery database (CDD Vault) from 101 to 23. - Created a containerized end-user data ingestion tool, automating the ingestion of computational data directly from an on premise high performance computing cluster. - Created an end-user tool with extensive guardrails to enable new compound registration from generative computational workflows via API calls to a commercial compound registry. New compound registration increased from hundreds per week to several thousand. - Built an automated cloud-based workflow to ingest experimental assay data to the data warehouse. Solution included automated triggering from new file upload to Google Cloud Storage and Cloud Functions for data processing, curve fitting, and data validation. - Parsed and managed terabytes of PII and PHI government collected medical records from Centers for Medicare & Medicaid Services (CMS), with service using internally developed dag-workflows pipeline tool and Google BigQuery. - Designed data pipelines for loading foxed width delimited files into Google Bigquery. Processed terabytes of PII/PHI medical records from Centers for Medicare & Medicaid Services (CMS). Executed using an in-house dag-workflows framework and optimized storage for Google BigQuery. Ingested and cleaned the GOSTAR structure activity relationship database to Google BigQuery. - Worked out complicated compound resources such as extracting pooled plate components from Titian Mosaic sample management software through REST API. - Benchmarked the utility of open-source synthetic feasibility tools in the drug discovery projects. ### Data Analyst @ Yale University Jan 2020 – Jan 2021 | New Haven, Connecticut, United States - Enhanced post-traumatic epilepsy prediction by analyzing neurologic ICU data, applying generalized estimating equations (GEE) and logistic regression to analyze EEG discharge burden, - Applied machine learning techniques such as boosting and random forest for improved predictor accuracy. ### Data Analyst @ Zemble Jan 2019 – Jan 2019 | United States - Generated visual analytics for healthcare plan comparison and conducted customer behavior analysis, highlighting key insights on plan value and customer engagement patterns. ## Education ### Master of Science - MS in Biostatistics University of Connecticut ### Master of Science in Chemical Genomics Peking University ### Bachelor of Science - BS in Pharmaceutics China Pharmaceutical University ## Contact & Social - LinkedIn: https://linkedin.com/in/lisonglu --- Source: https://flows.cv/songlu JSON Resume: https://flows.cv/songlu/resume.json Last updated: 2026-03-31