# Yeshwanth Govindu

> Software Engineer | AI/ML Cloud Platform | Distributed Systems | Kubernetes, AWS, GCP | Kafka, Spark | LLM Inference | vLLM, Triton

Profile: https://flows.cv/yeshwanth-govindu

Software Engineer with 5+ years of experience specializing in AI/ML cloud platforms, distributed systems, and cloud-native architectures across AWS and GCP. Strong expertise in Kubernetes (EKS/GKE), microservices, and infrastructure-as-code with Terraform.
Proven track record of building high-throughput data pipelines using Kafka and Spark, processing 500M+ messages/day, and reducing event propagation latency by 70%. Experienced in designing and deploying LLM inference pipelines using vLLM and Triton Inference Server on Kubernetes, improving system scalability and resource utilization.
Core competencies include:
- Cloud & DevOps: AWS (EKS, ECS, Lambda, CloudWatch), GCP (GKE, BigQuery, Stackdriver), Kubernetes, Docker, Terraform, CI/CD (Jenkins, GitLab)
- Data Engineering: Apache Kafka, Spark (Batch & Streaming), RabbitMQ, ETL Pipelines, S3/ADLS, Snowflake, BigQuery
- AI/ML: PyTorch, TensorFlow, LLM Fine-tuning (LoRA, SFT), vLLM, Triton Inference Server, NLP (BERT, GPT, LLaMA)
- Databases: MySQL, PostgreSQL, Snowflake, BigQuery, Redis
- Programming: Python, Java, C++, SQL
I am passionate about building scalable, high-performance systems that handle hundreds of millions of events daily. Always open to connecting with like-minded engineers and exploring opportunities in AI/ML infrastructure, data engineering, and distributed systems.
Location: Dallas, TX | Open to Remote and On-site opportunities

## Work Experience
### Software Engineer, AI/ML & Cloud Platform @ Prowesys
Jan 2024
• Led cloud-native deployment of distributed systems on AWS and GCP, leveraging Kubernetes (EKS/GKE) to build scalable, highly available services across multi-region environments using Terraform.  
•  Built and optimized event-driven data pipelines using Kafka and RabbitMQ, processing 500M+ messages/day with S3/ADLS backed storage, reducing event propagation latency by 70%.  
• Developed observability and monitoring solutions using Grafana, ELK, CloudWatch, and GCP Profiler, improving system visibility and reducing MTTR by 35% across production services.  
• Built and deployed LLM inference pipelines using vLLM and Triton Inference Server, improving system scalability and resource utilization across Kubernetes-based deployments.   
• Developed end-to-end LoRA and SFT fine-tuning pipelines for domain-specific models, improving compliance text classification accuracy using BigQuery and Snowflake feature engineering workflows.  
• Designed and deployed FastAPI-based ML services serving 1M+ daily requests, leveraging async processing and Redis caching to reduce p95 latency by 60%.

### Student Assistant @ University of North Texas
Jan 2023 – Jan 2024

### Software Engineer, Distributed AI Systems @ Tata Elxsi
Jan 2020 – Jan 2023 | Bengaluru
• Designed and deployed identity resolution microservices in Python/C++, unifying 50M+ Entra ID and HR records with versioned audit trails, self-healing workflows, and high availability.  
• Refactored a monolithic system into scalable microservices architecture, implementing contract testing and phased rollouts to improve system throughput by 45% without regressions.  
• Built high-throughput data pipelines using Kafka and Spark Streaming, processing 200M+ events/day with secure archival to S3/Glacier and compliance with FINRA/SEC retention policies.  
• Built and deployed LLM inference services using Triton Inference Server and vLLM, enabling scalable, production-grade model serving for enterprise workloads.  
• Optimized large language model performance (GPT, BERT, LLaMA) through quantization and efficient inference strategies, reducing memory usage by 60% and improving latency.  
• Developed and optimized multi-modal inference pipelines with advanced decoding strategies (Greedy, Beam, Sampling), improving token generation performance by 20%.


## Education
### Masters
University of North Texas

### Engineer's degree
Sree Venkateshwara College of Engineering


## Contact & Social
- LinkedIn: https://linkedin.com/in/yeshwanth-govindu
- Email: mailto:yeshwanth.g.us@gmail.com

---
Source: https://flows.cv/yeshwanth-govindu
JSON Resume: https://flows.cv/yeshwanth-govindu/resume.json
Last updated: 2026-04-18