# Yeshwanth Govindu > Software Engineer | AI/ML Cloud Platform | Distributed Systems | Kubernetes, AWS, GCP | Kafka, Spark | LLM Inference | vLLM, Triton Profile: https://flows.cv/yeshwanth-govindu Software Engineer with 5+ years of experience specializing in AI/ML cloud platforms, distributed systems, and cloud-native architectures across AWS and GCP. Strong expertise in Kubernetes (EKS/GKE), microservices, and infrastructure-as-code with Terraform. Proven track record of building high-throughput data pipelines using Kafka and Spark, processing 500M+ messages/day, and reducing event propagation latency by 70%. Experienced in designing and deploying LLM inference pipelines using vLLM and Triton Inference Server on Kubernetes, improving system scalability and resource utilization. Core competencies include: - Cloud & DevOps: AWS (EKS, ECS, Lambda, CloudWatch), GCP (GKE, BigQuery, Stackdriver), Kubernetes, Docker, Terraform, CI/CD (Jenkins, GitLab) - Data Engineering: Apache Kafka, Spark (Batch & Streaming), RabbitMQ, ETL Pipelines, S3/ADLS, Snowflake, BigQuery - AI/ML: PyTorch, TensorFlow, LLM Fine-tuning (LoRA, SFT), vLLM, Triton Inference Server, NLP (BERT, GPT, LLaMA) - Databases: MySQL, PostgreSQL, Snowflake, BigQuery, Redis - Programming: Python, Java, C++, SQL I am passionate about building scalable, high-performance systems that handle hundreds of millions of events daily. Always open to connecting with like-minded engineers and exploring opportunities in AI/ML infrastructure, data engineering, and distributed systems. Location: Dallas, TX | Open to Remote and On-site opportunities ## Work Experience ### Software Engineer, AI/ML & Cloud Platform @ Prowesys Jan 2024 • Led cloud-native deployment of distributed systems on AWS and GCP, leveraging Kubernetes (EKS/GKE) to build scalable, highly available services across multi-region environments using Terraform. • Built and optimized event-driven data pipelines using Kafka and RabbitMQ, processing 500M+ messages/day with S3/ADLS backed storage, reducing event propagation latency by 70%. • Developed observability and monitoring solutions using Grafana, ELK, CloudWatch, and GCP Profiler, improving system visibility and reducing MTTR by 35% across production services. • Built and deployed LLM inference pipelines using vLLM and Triton Inference Server, improving system scalability and resource utilization across Kubernetes-based deployments. • Developed end-to-end LoRA and SFT fine-tuning pipelines for domain-specific models, improving compliance text classification accuracy using BigQuery and Snowflake feature engineering workflows. • Designed and deployed FastAPI-based ML services serving 1M+ daily requests, leveraging async processing and Redis caching to reduce p95 latency by 60%. ### Student Assistant @ University of North Texas Jan 2023 – Jan 2024 ### Software Engineer, Distributed AI Systems @ Tata Elxsi Jan 2020 – Jan 2023 | Bengaluru • Designed and deployed identity resolution microservices in Python/C++, unifying 50M+ Entra ID and HR records with versioned audit trails, self-healing workflows, and high availability. • Refactored a monolithic system into scalable microservices architecture, implementing contract testing and phased rollouts to improve system throughput by 45% without regressions. • Built high-throughput data pipelines using Kafka and Spark Streaming, processing 200M+ events/day with secure archival to S3/Glacier and compliance with FINRA/SEC retention policies. • Built and deployed LLM inference services using Triton Inference Server and vLLM, enabling scalable, production-grade model serving for enterprise workloads. • Optimized large language model performance (GPT, BERT, LLaMA) through quantization and efficient inference strategies, reducing memory usage by 60% and improving latency. • Developed and optimized multi-modal inference pipelines with advanced decoding strategies (Greedy, Beam, Sampling), improving token generation performance by 20%. ## Education ### Masters University of North Texas ### Engineer's degree Sree Venkateshwara College of Engineering ## Contact & Social - LinkedIn: https://linkedin.com/in/yeshwanth-govindu - Email: mailto:yeshwanth.g.us@gmail.com --- Source: https://flows.cv/yeshwanth-govindu JSON Resume: https://flows.cv/yeshwanth-govindu/resume.json Last updated: 2026-04-18