# Yichen Zhang > Software Engineer · AI/ML Engineer | AI @ CMU School of Computer Science | LLM, RAG, Full-Stack | Actively Seeking New Opportunities! Location: San Francisco Bay Area, United States Profile: https://flows.cv/yichenzhang AI/ML Software Engineer with a B.S. in Artificial Intelligence from Carnegie Mellon University's School of Computer Science. I specialize in building and deploying LLM-powered products — at iHealth Labs, I built AI features that generated 110,000+ auto-drafted responses and reduced care team response time by ~54%. Experienced in Python, FastAPI, RAG, LLM APIs, AWS, and full-stack development. Highly proficient in AI-assisted coding, enabling me to build and ship products at exceptional speed. Self-driven with high agency and a strong ability to learn fast, adapt, and deliver independently. Passionate about shipping AI products that deliver real impact at scale. Open to Software Engineer, AI/ML Engineer, and Research Engineer roles — feel free to reach out! ## Work Experience ### Software Engineer - AI & Research @ iHealth Labs Jan 2024 – Present | Sunnyvale, California, United States • Built and deployed an AI auto draft feature in the internal portal to draft a response for new patient messages. 110,000+ drafts generated since deployment. Care team sent 40%+ AI drafts to patients without editing. Reduced the care team's time to respond to patient messages by ~54%, time per message median down from 1m22s to 38s. (Python FastAPI + OpenAI API + MongoDB) • Built and deployed a patient summary feature in the internal portal that uses LLM to generate a two-sentence summary for any given patient, enabling the care team to get a patient overview within 10s. Implemented a caching mechanism using MySQL that balances token consumption, latency, and information recency. (Python FastAPI + OpenAI API + MySQL + MongoDB) • Built and deployed an RAG chatbot in the internal portal to answer internal questions about any information in our enormous documentation with source citations. Used AWS Bedrock to chunk, embed, index, and retrieve document chunks. Answered 1,700+ questions since deployment. (Python FastAPI + OpenAI API + AWS Bedrock) • Led a team of senior colleagues in the internal Hackathon. Designed and built 6 AI characters to remind and encourage patient logging in a Web/Mobile interface. Led the presentation and earned the highest score for presentation. (Python FastAPI + OpenAI API + HTML + JavaScript + CSS) • Sampled 61,000+ patient chat messages and used LLM to classify each with one of 32 predefined patient intents. Analyzed intent distributions and provided insights into our chat automation effort. (Python + pandas + numpy) ### Software Engineer @ IvyMind Consulting LLC Jan 2025 – Jan 2025 Designed and built a semantic and keyword-based essay search system for 1,200+ student essays with a query latency under 1 ms. Embedded essays and prompts using the nomic-embed-text-v1.5 model and indexed the embeddings using FAISS. (Python + Text Embedding + FAISS + AWS EC2 + Docker + MongoDB) ### Teaching Assistant, 15-451 Algorithm Design and Analysis @ Carnegie Mellon University School of Computer Science Jan 2022 – Jan 2022 | Pittsburgh, PA Held weekly recitations and office hours, graded student homework and exams. ### Software Engineer Intern @ Roborock Jan 2018 – Jan 2018 | Beijing, China Tested the SLAM algorithm in the route-planning software for autonomous sweeping robots. ## Education ### B.S. in Artificial Intelligence in School of Computer Science Carnegie Mellon University ### High School Diploma PRINCETON INTERNATIONAL SCHOOL OF MATHEMATICS AND SCIENCE, INC. ## Contact & Social - LinkedIn: https://linkedin.com/in/yichen-zhang-a06759290 --- Source: https://flows.cv/yichenzhang JSON Resume: https://flows.cv/yichenzhang/resume.json Last updated: 2026-04-05