San Francisco, California, United States
Maintained and helped scale compute and data platform used by a growing research organization of over 100 computational scientists and engineers for running ML workflows and experiments
Led development of an artifact service that enabled researchers to register molecular artifacts with schematized custom metadata and discover and reference them for their experiments; >40k artifacts registered
Developed pipelines for ingesting clinical and lab data into new data lakehouse
Executed and helped strategize organization-wide migration of a lab management system, migrating tens of GBs of structured data across thousands of fields; coordinated cross-functionally with PMs, analysts, and scientists to minimize disruption
2017 — 2020
San Francisco Bay Area
Developed and maintained de-identification pipeline used to deliver cancer patient datasets to a large pharma company; implemented using PySpark, parquet files on S3 for persistence, and AWS EMR for deployment
Headed >1000x scaling of CDC (change data capture) service processing 30 million records/day during data migrations for large health system customers; implemented observability via New Relic and performance tuning on RabbitMQ
Refactored an ETL pipeline, which ingests manually abstracted data and merges from disparate data sources, in order to align with major architectural update; moved persistence from S3 to AWS Aurora, automated test runs on CircleCI
Designed and developed services to transition Django application monolith to microservices architecture deployed on a Kubernetes cluster; implemented form app backends with Flask/DynamoDB/PostgreSQL and built an interface to Auth0
2016 — 2017
Palo Alto, California
Developed the clinical analytic reports feature, which aggregated clinical and molecular data of thousands of patients into insights for clinicians; wrote complex queries leveraging PostgreSQL jsonb and built framework to maintain them
Implemented content migration strategy for managing versioning and updating of clinical trial and RxNorm therapy content across single-tenant customer databases
Maintained the Syapse Application, which included writing/optimizing SQL queries, implementing REST API endpoints in a Django application, writing migrations, and writing SPARQL queries against Blazegraph (AWS Neptune)
2014 — 2015
Toronto, Canada Area
Elicited requirements from stakeholders, developed medium fidelity prototypes, conducted user acceptance testing and wrote software requirements specifications for web application product
Managed team projects throughout the software development lifecycle from planning to deployment
Spearheaded implementation of new documentation and communications platform (Confluence) to resolve team collaboration issues
Western Province, Kenya
Led needs assessments and infrastructure improvement at 4 government health facilities
Planned and supervised campaign to treat and prevent jiggers within 3 communities spanning 51 households in Sabatia District
Conducted research on pharmaceutical drug stock-outs in rural Kenya identifying inefficiencies within the supply chain
Education
University of Toronto
Bachelor of Applied Science (B.A.Sc.)
Hack Reactor