# Yang Song > AI Practitioner | Multimodal Foundation Models Location: San Jose, California, United States Profile: https://flows.cv/yangsong1 ## Work Experience ### Staff Software Engineer / Ex-Engineering Manager @ Waymo Jan 2018 – Present Building multimodal foundation models for Waymo. Google Scholar: https://scholar.google.com/citations?user=Y6L6ZYsAAAAJ Invited talk at ICCV 2019 Workshop on Autonomous Driving (http://wad.ai/) Another example of my work, https://blog.waymo.com/2020/04/using-automated-data-augmentation-to.html ### Staff Software Engineer @ Google Jan 2008 – Jan 2018 Tech Lead Manager in Mobile Vision. I led the first launch of natural world vertical in Google Lens; and project lead for Production Recognition. I was organizer for Fine-Grained Visual Categorization competitions and workshops (FGVC4 and FGVC5). Tech Lead for image content feature project. I use computer vision and machine learning techniques in Google Image Search. Tech Lead for video classification projects. I developed state-of-the-art computer vision and machine learning methods for YouTube and Google Image Search. ### Member Of Technical Staff @ Kosmix Jan 2007 – Jan 2008 ### Research Scientist @ Fujifilm Jan 2003 – Jan 2007 ### Postdoc Scholar @ Caltech Jan 2003 – Jan 2003 ### Graduate Research Assistant @ Catech vision lab Jan 1997 – Jan 2002 ## Education ### Ph.D. in Electrical Engineering Caltech ### Tsinghua University ## Contact & Social - LinkedIn: https://linkedin.com/in/yang-song-bba4434 - Portfolio: http://www.vision.caltech.edu/yangs/ - Portfolio: https://ai.google/research/people/author38270/ --- Source: https://flows.cv/yangsong1 JSON Resume: https://flows.cv/yangsong1/resume.json Last updated: 2026-04-12