2025 — Now

Palo Alto, CA

2022 — 2025AmazonSoftware Engineer II

2022 — 2025

Palo Alto, California, United States

Develops features on a remote, scalable, high-throughput ML inferencing service to rank relevant ads to show on Amazon’s retail website, leveraging Triton for GPU inferencing. This enables using large-scale DL models (1B+ params) for ad ranking within a 40ms latency budget at 200K+ TPS to support a $1B advertisement ecosystem.

Develops features for a microservice deployment platform used to host real-time services in Amazon Ads. This leverages ECS/EC2 + Envoy proxy to build a network of 15k+ hosts with 1ms network latency with out-of-the-box client-side load-balancing, traffic shaping, and observability (request tracing).

2020 — 2022AmazonSoftware Engineer

2020 — 2022

Palo Alto, California, United States

On the Machine Learning Delivery team within the Sponsored Products organization. Works to develop our scalable remote ML inferencing solution for the advertisement delivery pipeline. Helps maintain an inferencing server that can load and serve modeling results to our ad server.

2020 — 2020Juni LearningComputer Science Tutor

2020 — 2020

Berkeley, California, United States

Taught students aged 8 to 18 in the principles of computer science and programming through a series of focused curriculums and hands-on programming projects.

Developed a learning strategy for each student based on their age, prior experience, and inclination towards different computer science topics.

Focused on a wide variety of subjects in computer science, including learning standard industry programming languages such as Python, Scratch, Java, and HTML/CSS, but also the basics of web development, data analysis, and computer graphics.

Experienced in a range of education levels, from basic graphics in Scratch up to test-taking strategies for the AP Computer Science exam.