# Hitesh Wadhwa > Sr. Software Engineer @ Articul8 | Ex - LLM/RAG Researcher w/ Microsoft | MS CS @ UMass Amherst'24 Location: San Francisco Bay Area, United States Profile: https://flows.cv/hiteshwadhwa I’m a backend and platform engineer who likes building systems that actually hold up in production. The kind that scale, fail gracefully, and don’t wake someone up at 3am. Most of my work has been around distributed systems, orchestration, and automation. I enjoy taking messy or ambiguous problems and turning them into something reliable and boring in the best way possible. Clean execution paths, fewer edge cases, good observability, things that just work. I’m pretty stack-agnostic and tend to ramp quickly on whatever the problem needs. I care more about understanding how a system behaves end-to-end than any specific tool or framework. Outside of work, I like tinkering with side projects, automating random parts of my life, staying active, and generally learning new things just because they’re interesting. ## Work Experience ### Senior Software Engineer @ Articul8 AI Jan 2025 – Present | San Francisco Bay Area • Led development of a distributed agentic data ingestion platform on Ray, increasing throughput 6× and enabling fault-tolerant execution at scale. • Architected and launched AWS Marketplace integrations, owning registration, metering, billing, and IaC across API Gateway, Ory, SQS, and Lambda. • Designed the platform’s unified authentication and metering gateway using Kong and Ory Hydra, supporting secure multi-tenant workloads. • Modernized the orchestration engine with DLQs and retry mechanisms, reducing unrecoverable jobs by ~80% and improving system reliability. • Optimized Ray execution performance using Actors to eliminate bottlenecks and improve overall throughput. ### Software Engineer @ Articul8 AI Jan 2024 – Jan 2025 | San Francisco Bay Area • Developed core GraphQL APIs for Articul8 AI’s LLM platform, enhancing type-safe schemas and caching layers. • Built Knowledge Graph APIs for entity linking and semantic retrieval, significantly improving internal developer usage. • Implemented a robust notifications system, facilitating cross-team collaboration and debugging across various departments. ### Graduate Researcher - Large Language Models (LLMs) @ Microsoft Jan 2024 – Jan 2024 • Under review at EMNLP, researched on mechanistic interpertability of RAG models on LLMs and SLMs. • Built a RAG pipeline with Langchain Agents and Pinecone for context generation to see changes in parametric memory use. • Prompt Engineered with GPT-4 for synthetic data generation with quality assurance reducing manual efforts by ### Computer Science Grader @ University of Massachusetts Amherst Jan 2023 – Jan 2023 | Amherst, Massachusetts, United States ### Data Science Intern @ App Orchid Inc Jan 2023 – Jan 2023 | San Ramon, California, United States Utilized Machine Learning, Data Analysis, and NLP techniques to create training datasets, resulting in a 40% improvement in dataset quality and accuracy. • Conducted proof-of-concept (POC) projects with PyTorch and TensorFlow, evaluating 7 new models, and presented results to data science team, facilitating informed decision-making and driving innovation. ## Contact & Social - LinkedIn: https://linkedin.com/in/hitesh-wadhwa --- Source: https://flows.cv/hiteshwadhwa JSON Resume: https://flows.cv/hiteshwadhwa/resume.json Last updated: 2026-04-01