Ask Rino
Ask Rino is an AI companion allowing Amazonians within Devices and Services to operate on sensitive financial data and automate repeated workflows.
Ask Rino has been used by 400+ finance managers at Amazon across 41,000+ conversations, featured at internal FinTech conferences, actively pushed by the VP (Tobias Straub) into cross-org adoption, and referenced by the Amazon CFO (Brian Olsavsky). Now a VP-level Kingpin goal with a 100% onboarding target.
As the highest code contributor, I led Ask Rino from 0-1 and built core infrastructure that powers it:
• Architected one of the first remote MCP server infrastructures at Amazon, a centralized gateway connecting finance services directly into the AI platform. 2 MCP servers onboarded, 175+ tool invocations in 3 months.
• Extended the MCP protocol to a novel third transport layer to allow Ask Rino to securely operate on users' local systems (think OpenClaw).
• Built agentic text-to-SQL workflows that write, self-correct, validate, and execute queries on sensitive financial databases autonomously. Backed by an eval stack with 93%+ accuracy across 93 test queries.
• Secured Ask Rino's citation sources and links with CloudFront Signer allowing restricted access within Amazon. Supports 9 file types for user upload and chat.
• Solved a critical ThrottlingException error due to Claude's rate limits by proposing a novel model cycling method that scaled throughput to 600 requests per minute, bypassing the need for VP/SVP-level provisioned capacity approval.
Query API
• Built the Query API for blending calculations across monthly and lifetime device cost reports - 2,100+ calls in its first 4 months, now central to reporting and downstream services.
• Optimized latency by 98%, reducing deepest-level lineage calculation from 33.5s to 546ms.