Developed full-stack features for an AI-powered platform utilizing Retrieval-Augmented Generation (RAG) to revolutionize content creation for presentations.
Implemented new features by building and enhancing the FastAPI backend, integrating OpenAI models through LlamaIndex, managing embeddings using PGVector (PostgreSQL), and fine-tuning prompts to optimize the relevance and clarity of LLM outputs. Additionally, developed the Next.js frontend using TypeScript and Tailwind CSS to create a seamless and user-friendly experience.
Boosted presentation generation speed by nearly 40% by parallelizing LLM calls using concurrent threading and optimizing the selection of LLM models to enhance performance while reducing costs.
Wrote end-to-end tests with Checkly and Playwright and integrated them into the CI/CD pipeline using GitHub Actions.