Emeryville, California, United States
Worked on the music platform team, responsible for handling data uploads from major record labels and streaming services such as Sony Music, UMG, Warner, and Spotify. Developed backend services and data pipelines that ingest and process music metadata, supporting both internal and external products.
• Built backend services in Python (FastAPI) to classify music releases, processing thousands to millions of albums per day
• Improved a legacy album classification pipeline by switching to incremental processing, reducing processing time by ~95%
• Maintained and extended backend services by adding features, running data backfills, and fixing critical issues to ensure accurate and consistent music metadata
Tech Stack: Python, Golang, SQL, AWS, Apache Airflow, Docker, Kubernetes, Helm, MySQL, Oracle, HDFS, Elasticsearch, Grafana, Jenkins, CI/CD