Software Engineer with 5+ years of experience in building scalable backend and distributed systems. Experienced in designing high-throughput, low-latency services using Python, Kafka, Kubernetes, and AWS.
Experience
2024 — Now
2024 — Now
• Built a scalable recommendation backend using Python, FastAPI, and Django, serving 2M+ users with <200ms p95 latency at 10k+ RPS.
• Deployed services on Kubernetes-based distributed infrastructure, enabling auto-scaling, high availability, and fault tolerance.
• Developed hybrid recommendation systems (collaborative + content-based) with LLM embeddings and vector search (FAISS, Pinecone), improving cold-start accuracy by 35%
• Designed and delivered RESTful APIs and async GraphQL services with 99.99% uptime under high traffic
• Optimized PostgreSQL with sharding, indexing, and Redis caching to handle 5M+ daily events efficiently
• Built CI/CD pipelines (GitHub Actions, Jenkins) for automated testing, containerization, and Kubernetes deployments
• Created ETL pipelines using PySpark and SQL to process large-scale user interaction data
• Implemented real-time streaming with Kafka and Spark Streaming for instant recommendation updates
• Developed async systems with Celery, RabbitMQ, and Kafka for background processing and model retraining
• Used MLflow for A/B testing and model tracking, improving CTR, retention, and engagement
• Achieved 95%+ test coverage using PyTest for reliable deployments
• Designed highly scalable systems with rate limiting, retries, circuit breakers, handling up to 50k RPS
• Set up monitoring and observability (Prometheus, Grafana, ELK) and built dashboards using React & TypeScript
• Ensured secure and compliant systems with OAuth2, JWT, GDPR, and SOC2 standards
2020 — 2023
2020 — 2023
• Engineered and scaled internal AI developer tools using Python, streamlining model development, testing, and deployment
workflows, improving developer productivity by 35%.
• Built and productionized RESTful APIs and GraphQL services using FastAPI and Flask, enabling flexible, real-time AI/ML inference and seamless integration across internal financial platforms.
• Architected high-performance async and distributed systems using FastAPI, Celery, and RabbitMQ, supporting low-latencyinference and scalable background processing for compute-intensive workloads.
• Designed scalable microservices architecture and leveraged Flask for rapid prototyping, accelerating feature development cycles and system modularity.
• Developed end-to-end ML pipelines and reusable SDKs covering data preprocessing, feature engineering, model training, evaluation, benchmarking, and experiment tracking (MLflow), improving reproducibility and experimentation speed.
• Implemented LLM-powered features (prompt engineering, embeddings, semantic search) and built RAG pipelines, enabling context-aware, intelligent responses over large-scale document corpora.
• Deployed scalable AI/ML systems on AWS (S3, EC2, Lambda, API Gateway), optimizing for cost, performance, and highavailability in cloud-native environments.
• Designed and implemented secure, scalable backend architectures with OAuth2/JWT authentication, rate limiting, caching, and horizontal scaling to support high-throughput financial systems.
• Containerized applications using Docker and orchestrated deployments with Kubernetes, enabling auto-scaling, fault tolerance, and high availability.
• Built and optimized data storage and caching layers using PostgreSQL, MongoDB, and Redis, ensuring low-latency access and high-throughput data processing.
• Processed large-scale financial datasets using Apache Spark and PySpark and integrated real-time streaming with Kafka, enabling continuous ingestion and near real-time analytics.
Education
Saint Louis University