Accomplished Staff Software Engineer and Tech Lead at Waymo, specializing in the development of state-of-the-art generative AI for autonomous vehicle simulation.
2024 — Now
Mountain View, California, United States
Tech leading our team to develop state-of-the-art system to generate and simulate realistic environment(Scene, Camera, Lidar data) for Waymo driver eval and validation using latest techs like video diffusion(Veo3.1), World Models(Genie3), 3DGS, NeRF, LLM Agent.
Large-Scale 3DGS Sensor Simulation: Founded and scaled the 3D Gaussian Splatting (3DGS) sensor simulation at Waymo from a small project to over one million reconstructions. This initiative reconstructs realistic 3D scenarios from sensor logs for safety evaluations and has grown from a team of 3 to a 20+ person effort, shaping Waymo's next-gen ML-based simulation.
Sensor Diffusion: Contributed to a research project using video diffusion models to generate synthetic sensor data for testing autonomous vehicle behaviors, with work published in IROS 2025 (Oral). This evolved to use the world model, Genie3, for multi-modal (Camera, Lidar) simulation of rare events.
SceneDiffuser and SceneDiffuser++: Founded and developed the SceneDiffuser model, the core of SceneAI, for large-scale, controlled scenario creation and traffic generation, with research published in NeurIPS 2024. SceneDiffuser++ expanded this for city-scale traffic generation, with work published in CVPR 2025.
SceneAI: Directed the development of the generative AI engine for the team's data synthesis platform, delivering tools to generate traffic, densify rare events, and create scenarios from natural language.
Agentic Simulation Realism Triage: Proposed and developed a Gemini-based agent to triage the realism of simulation results, enhancing its tool-use and reasoning capabilities.
Agentic Scene Generation: Developed a Gemini-based agent using a diffusion model for end-to-end synthetic scenario generation from natural language and images. This project won a company-wide hackathon and was launched in the data synthesis platform.
AutoDense: Developed a diffusion model to generate rare events from log, supporting multiple launches.
2022 — 2024
Lead our team’s effort on three generative AI(AIGC) products with four engineers, we work with top researchers to train deep learning diffusion model(similar to DALL·E 2, Stable Diffusion) and build system around it to scaly generate realistic synthetic data for running simulation of Waymo and evaluate Waymo driver behaviors.
Responsible for model design & implementation & training, model integration with production simulation system, cross team collaboration, production usage landing and onboard engineers engagement.
AITrafficGen: Landed our first generative AI product: AITrafficGen, which support generate realistic traffic in large scale(>100k scenarios) to discover long-tail events. Model produces 16 agents, with 12s trajectory of each, similar to diffusion based video generation. The synthetic set created with it already used in prod launches.
AutoDense: Proposed and building our second generative AI product for automatic long tail events densification, support constrainted diffusion to generate similar rare events given seed long tail events to densify evaluation signal. This is similar to image guided diffusion. Backbone of it is based on Perceiver IO, and used transformer as encoder & denoiser. We build the model as large multi-task foundational model.
LLM Guided SceneGen: Inspired by DALL·E 3, proposed third generative AI product use multi-modal LLM to guide our diffusion model generate synthetic scenario end-to-end given natural language description and image to boost the synthetic data generation at Waymo. Initial protoptype shows very promising result.
2021 — 2022
Work in Google Assistant Evaluation Eng Team, Focused on building our next-gen Assistant Eval Infra: Assistant Hermetic Eval which provides high eval fidelity, better data
privacy, better scalability.
Tech-Led three engineers to build two critical subsystems: Output scrubbing and on-device hermetic eval
Scoped the open-ended problem, designed the whole project, lead three engineers implemented the project.
– Drove the collaboration with Google Assistant Infra team, NLG(Natural Language Generation) Team, Feature teams to
implement the project.
– Tackled the key problem by using the semantic based annotation in the NLG request.
– Presented the project in org’s all-hands.
2019 — 2021
Mountain View
In Google Assistant Evaluation Eng Team, I Proposed & designed & implemented Assistant Eval Fidelity Toolchain, which is a series of tools to help improve Assistant Eval
Fidelity including:
– Fidelity Dashboard: Dashboard show eval fidelity for each surface feature. Built using Google SQL, PLX.
– Fidelity Search: Automatically search for the data that can increase the eval fidelity by construct field tree, eliminate nodes,
issue data to Assistant stack.
– Fidelity Bug Manager: File bugs to developers for fidelity findings from fidelity search automatically.
– Planned & conducted Fidelity Fixit event with 7 feature teams 1 surface team, fixed 38 fidelity issues.
– Authored 31,166 lines of codes for the toolchain and got 3 spot bonuses, 4 peer bonuses for it.
Designed & Implemented deep learning based Side-by-side eval noise classifier
– Designed & implemented noise data collection pipeline in our eval tool to collect noise, join eval data and extract noise
features.
– Designed & implemented the noise classifier training pipeline using TensorFlow, TFX(Tensorflow extended) to train the DNN
model and the random forest model for noise classification.
– Designed & implemented the noise confidence metric in in eval system for users to filter out the noise automatically from their
eval report, and also provides noise feedback for training.
– Presented the classifier in Org’s all-hands
– Authored 18,539 lines of codes for the project
2017 — 2019
Work in Google Assistant Infra Team, I migrated the Google Assistant Conversation State API, Built tool to audit Assistant backend failure and improving stability by injecting backend errors.
Education
2020 — 2022
Stanford University
Artificial Intelligence
2020 — 2022
2015 — 2016
University of Wisconsin-Madison
Master's degree
2015 — 2016
2012 — 2013
University of California, Berkeley
Exchange
2012 — 2013
2010 — 2014
Tianjin University of Technology
Bachelor's degree
2010 — 2014