@Amazon Prime - AutoTargeting
● Designed and implemented offline data processing pipeline using PySpark with ability to process 200MM+ records, and generated segment level model performance results used inside WBR report.
● Built automation pipeline for data loading and reporting jobs, set up Tableau WBR report to review clients' model performance.
● Built rule-based automating selection tool to reset WW 200+ marketers' experiment baseline so that all experiments have incremental performance.
● Implemented offline simulation tool and reduced 80% time to evaluate performance of different feature generator methods (Raw, Prime2Vec, XGBoost) combined with several scoring algorithms (BLIP, BLR).