Manhattan, New York, United States
Coordinated the roll out of production level automation for two major publishing companies. Maintained this automation and enhanced it.
Fine tuned a RoBERTa base transformer for the automated classification of company data. Used PyTorch to develop an end to end machine learning pipeline to preprocess, label, and then train a supervised large language model.
Worked extensively with Snowflake database to perform ETL and build automated data pipelines. Wrote data transformations using DBT. Loaded source data with DML and DDL.
Added production level enhancements to Django based web applications. Worked with asynchronous Celery jobs extensively to support services utilized by internal clients.