# Mario Tinoco > Staff Software Engineer | AI-Native Systems | UCLA Alum Location: Davis, California, United States Profile: https://flows.cv/mariotinoco I build at the intersection of cloud infrastructure, edge computing, and AI. My work on personalized cloud caching architecture led to the development of a patented system for differential static and dynamic caching on edge computing platforms (US Patent No. 12,332,964). Additionally, in my career, I have helped a VC-backed startup scale from concept to $3 MM ARR in its first year, applying lean cloud architectures and rapid experimentation. With over three years of experience in AI-assisted development, I have enhanced engineering velocity and creativity by utilizing tools such as GitHub Copilot, Cursor, and prompt-driven workflows. I lead cross-functional teams in integrating LLMs for generative personalization, agent-based task orchestration, RAG systems, and eval pipelines where models act as judges, orchestrators, and cost-reward models. Core Skills: Edge Computing • Cloud-Native Architecture • LLM Integration • Generative AI • Fusion AI • Multi-Agent Systems • RAG Pipelines • Kubernetes • AWS • Pulumi • Redis • CI/CD • MLOps • Python • TypeScript • Java • System Design 🧠 AI-Augmented Builder: Expert in designing feedback loops, eval pipelines, and cost-reward models for LLMs 🌐 Patent Contributor: Co-inventor of a differential edge caching system patented in 2025 ⚙️ Early Adopter: Leveraging Copilot and Cursor to supercharge engineering velocity 📊 Strategist: Skilled in hybrid-agent routing, embeddings, token streaming, and vector search architecture “Perfection is achieved, not when there is nothing more to add, but when there is nothing left to take away.” – Antoine de Saint-Exupéry ## Work Experience ### Staff Software Engineer @ interface.ai Jan 2024 – Jan 2026 | California, United States Architected low-latency, real-time voice AI systems and frameworks for autonomous agent collaboration and task execution. Optimized high-throughput telemetry using ClickHouse for real-time feedback loops; streamlined delivery via AI-generated code under strict human-reviewed architecture standards. Integrated GraphRAG to enhance context accuracy and response density across agentic pipelines. Engineered real-time streaming voice responses over SSE and WebSockets; built multi-turn context-aware NLP for high-performance interactions. Designed LLM evaluation frameworks for automated output scoring and regression testing; implemented prompt versioning and model routing to reduce inference costs. Built AI safety guardrails — PII redaction, content filtering, and anomaly detection — across production agentic pipelines. ### Principal Software Engineer, Infrastructure Architect @ Nostra AI Jan 2022 – Jan 2024 Architected, designed, and implemented the world’s first edge delivery engine for e-commerce. This included speed performance as well as edge A/B testing, real-time personalization and data engineering to support the platform. Based on AWS (RDS, Redshift, Fargate, Lambda, DynamoDB) and Cloudflare (Workers, R2, and CDN/Load Balancing). Built initial MVP with a team of 3 in just 2 months and then scaled revenue to 2.5MM ARR 10 months later. ✏️ Project Highlights - Through infrastructure-as-code (IAC) implemented and operated the cloud infrastructure and software for a terabyte-scale multi-master redis cluster for the in-memory global edge caches. Managed highly-available container clusters with thousands of container instances for scalable data mining, and edge accelerating content. Implemented petabyte-scale data-service pipelines into both S3 and a Redshift. Managed the transactional data modeling for domain services. Developed multiple ETL workflows, spanning end-to-end from edge capture to data visualization. Developed python notebooks for BI and for data visualization for our customers. Collaborated with statistics PhD to develop and release predictive models (Bayesian). Improved reliability through proactive monitoring and alerting. ### Lead Software Engineer @ Sun Nuclear Corporation Jan 2020 – Jan 2022 Implemented a multi-tenant, cloud-native platform for the existing on-premise radiotherapy application. Refactored a monolith .NET web application to cloud microservices deployed on Kubernetes (EKS) using infrastructure-as-code (Pulumi) through GitHub Actions and Ansible. Worked with medical physicists and mechanical engineers to develop CUDA models to assist in evaluating 3D cancer imaging for anomalies such as over-radiation. ✏️ Project Highlights - containerization of GPU workloads allowed for hybrid deployments onto on-prem GPUs as well as AWS EKS for new SaaS offering and elastic GPU capacity for cost-effective scaling. CICD workflows to create ephemeral development and testing environments, and to validate model updates to FDA compliance through continuous testing. Developed governance-as-code tools in GitHub workflows. Integrated telemetry collectors with time-series Prometheus for Grafana ops dashboards. ### Senior Software Engineer @ Williams-Sonoma, Inc. Jan 2018 – Jan 2020 | Sacramento, California Area Developed full-stack features (Java/Typescript) for the checkout on the e-commerce platform of the Williams-Sonoma brands (Pottery Barn, Mark & Graham, West Elm). ✏️ Project Highlights - Lead the frontend redesign to a single-page-app, and integration into analytics system, where A/B testing showed the acceleration resulted in a 10% increase in revenue for a multi-billion dollar checkout. ## Education ### Bachelor’s in Computer Science in Computer Science UCLA Jan 2014 – Jan 2018 ### Associate Degree in Mathematics in Mathematics Sacramento City College Jan 2011 – Jan 2013 ## Contact & Social - LinkedIn: https://linkedin.com/in/mario-tinoco --- Source: https://flows.cv/mariotinoco JSON Resume: https://flows.cv/mariotinoco/resume.json Last updated: 2026-03-22