Build data pipelines and performed data analysis for high throughput affinity selection drug discovery platform
•
Maintain relational data warehouse in Big Query (Google Cloud Platform) utilizing ETL processes
•
Build and parallelize data ingestion pipeline, reducing compute time by 80%
•
Improve data cleaning and transformation techniques to standardize data formats and enrich datasets with additional metrics
•
Build machine learning cheminformatics tool, enabling chemists to analyze similarity of target binding mechanisms
•
Analyze platform productivity by integrating modern and historic data with SQL queries, performing statistical analyses in Python, and presenting results by building visualization dashboards in Spotfire