# Punam P. > Full Stack AI Engineer | LLM & RAG Systems | JAVA SpringBoot | FastAPI, React, AWS, LangChain | MLOps & Cloud-Native AI | Currently @ Morgan Stanley Location: San Jose, California, United States Profile: https://flows.cv/punam I am a Full Stack Engineer and AI Engineer with 6+ years of experience building scalable, cloud-native applications and production-grade AI systems. I specialize in designing end-to-end platforms that combine strong backend architectures including microservices, RESTful APIs, asynchronous processing, database design, caching, and authentication/authorization with modern frontend frameworks and advanced AI/ML capabilities. Currently, I work on LLM-powered applications and intelligent data systems, developing Retrieval-Augmented Generation (RAG) pipelines, vector search platforms, and full-stack AI dashboards. On the backend, this includes building API gateways, service-to-service communication, background workers, and scalable inference services that improve speed, accuracy, and decision-making in enterprise environments. I have hands-on experience taking AI products from idea to production, with a strong focus on system reliability, security, performance, and observability. My background spans financial services, healthcare, and insurance, where I’ve built backend-heavy systems such as real-time data pipelines, secure and rate-limited APIs, event-driven services, and intelligent automation tools under strict compliance and regulatory frameworks. I bring a balanced skill set across backend engineering, frontend development, cloud infrastructure, and MLOps. Key areas of expertise include backend technologies such as FastAPI, Node.js, Express, RESTful API design, microservices architecture, PostgreSQL, Redis, OAuth 2.0, JWT, asynchronous job processing, along with LLMs, RAG architectures, React, TypeScript, AWS, Kubernetes, MLOps, and scalable system design. I am passionate about building backend-driven, intelligent, user-focused systems that solve real business problems at scale. ## Work Experience ### Software Engineer – AI Platforms @ Morgan Stanley Jan 2025 – Present | California, United States - Architected a secure RAG-based financial document intelligence platform using Java Springboot, Node.js, Python, LangChain, and Weaviate, applying microservices architecture, event-driven design, and modular service boundaries to process compliance reports and product documentation, reducing analyst research time by 60%+ across risk and compliance teams. - Developed a full-stack AI dashboard using ReactJS, TypeScript, and TailwindCSS with Node.js REST APIs, enabling analysts to review LLM outputs and feedback signals, improving prompt accuracy response relevance 28% within six weeks. - Designed scalable Node.js and Python microservices using Express and FastAPI to orchestrate retrieval pipelines, document embedding services, and asynchronous AI inference workflows across internal research systems. - Implemented a secure Node.js API gateway with OAuth 2.0 authentication and request validation, supporting more than 18K monthly AI requests across enterprise document intelligence workloads. - Deployed containerized Springboot, Node.js and Python services on AWS EKS using Docker, scaling inference workloads to handle 3× traffic growth while maintaining average response latency under 420 ms. ### Senior Software Engineer @ Apexon Jan 2022 – Jan 2023 | Pune, Maharashtra, India - Engineered a clinical trial oversight analytics platform using React, TypeScript, Redux Toolkit, and Java, Node.js APIs, enabling research teams to monitor enrollment progress, site performance, and protocol compliance across 15+ clinical study sites, improving reporting turnaround by 35%. - Built interactive clinical data dashboards using React, D3.js, and GraphQL services, enabling investigators to analyze adverse events and protocol deviations across 200K+ trial records, improving operational visibility for clinical operations teams. - Optimized clinical analytics data pipelines using React Query and Node.js, Springboot microservices, reducing redundant API calls by 40% and improving dashboard response time during high-volume clinical reporting cycles. - Improved dashboard performance using React memoization, component virtualization, and API caching, reducing page load latency by 38% for complex clinical monitoring dashboards. - Implemented secure role-based access control workflows using OAuth 2.0 authentication and Node.js authorization services, ensuring compliant access to sensitive healthcare datasets for 200+ clinical investigators and administrators. ### Product Developer @ Hexaware Technologies Jan 2020 – Jan 2022 | Pune, Maharashtra, India - Developed an insurance policy servicing platform for LV.com using React, TypeScript, Java and Node.js microservices, enabling agents to manage quotes, endorsements, and policy lifecycle workflows across 50K+ policy records, improving servicing efficiency by 32%. - Implemented real-time underwriting validation services using Node.js rule evaluation engines and modular React UI components, automating eligibility checks and reducing manual underwriting review effort by 28% during high-volume policy enrollment cycles. - Integrated Elasticsearch document search with GraphQL and REST APIs, enabling agents to retrieve claim guidelines and regulatory documents across 1M+ insurance policy records within seconds. - Designed event-driven policy update workflows using Node.js asynchronous processing, enabling real-time synchronization across customer, billing, and policy systems processing 20K+ daily policy updates. - Optimized application responsiveness using lazy loading, indexed queries, and API performance tuning, reducing UI latency by 34% across high-traffic insurance servicing dashboards. ### Software Developer @ Primal Infosys Jan 2019 – Jan 2020 | Pune, Maharashtra, India - Built an e-commerce customer support and order management platform using React, JavaScript (ES6), and Node.js services, enabling users to manage profiles, orders, and support tickets across 25K+ monthly customer interactions, improving support resolution turnaround by 30%. - Engineered RESTful backend services using Node.js, Express, and PostgreSQL, supporting real-time order updates, ticket creation, and account management workflows processing 8K+ daily transactions. - Designed order support routing and prioritization workflows using Node.js business logic services and asynchronous processing, reducing support backlog and improving ticket assignment accuracy by 25%. - Integrated React front-end interfaces with Node.js APIs and asynchronous data pipelines, enabling real-time updates across customer accounts, order status, and service history dashboards. - Optimized platform performance using API caching, indexed database queries, and efficient React state management, reducing dashboard response time by 36% for frequently accessed e-commerce support tools. ## Education ### Master's degree in Computer Science University of the Pacific ### Bachelor's degree in Computer Engineering Viit ## Contact & Social - LinkedIn: https://linkedin.com/in/punam-p - GitHub: https://github.com/PUNAMRAMPUKALE?tab=repositories --- Source: https://flows.cv/punam JSON Resume: https://flows.cv/punam/resume.json Last updated: 2026-04-17