Sunnyvale, California, United States
• Designed and operated distributed search infrastructure for multi-tenant enterprise knowledge systems, enabling scalable retrieval across highly connected datasets.
• Built large-scale indexing pipelines and query-serving APIs using Python and FastAPI powering OpenSearch-based retrieval systems handling 200K+ daily queries with 99.9% uptime.
• Reduced search latency by 35% through query optimization, asynchronous I/O, connection pooling, and caching strategies.
• Implemented hybrid lexical–semantic retrieval architecture, integrating embedding-based semantic search with traditional inverted-index ranking.
• Instrumented observability and performance monitoring for search services, tracking latency, throughput, and reliability metrics across distributed retrieval pipelines.