San Francisco, California, United States
Languages: Rust, Python, Java, TypeScript, WASM
Distributed Systems & Data: Kafka, Pulsar, Flink, Spark, ArangoDB, Redis, PostgreSQL, Parquet, Pub/Sub
Infrastructure: AWS, GCP, Kubernetes, Docker, CI/CD, OpenTelemetry, Prometheus
Backend & Ops: High-throughput REST/gRPC/GraphQL APIs, caching/batching, latency optimization
Leadership: Architecture design, platform engineering, cross-team collaboration, mentoring
Key Contributions:
• Led a multi-region data replication system handling 5M+ messages/sec, ensuring consistency at scale.
• Built and deployed a Rust-based Function-as-a-Service platform supporting Rust/JavaScript functions, CLI tools, and config management—boosting developer productivity and extensibility.
• Engineered multi-language SDKs (JavaScript/TypeScript), CLI tools, and sandbox environments to reduce onboarding time by 50%—enhancing developer productivity and satisfaction.
• Head LLM-driven observability tooling using LangChain, automating metric analysis and performance tuning in production environments.
• Designed high-throughput REST/gRPC APIs delivering real-time data globally with sub-100ms latency.
• Led design of distributed, transactional payment workflows using ArangoQL and CEP, ensuring reliability, consistency, and compliance across multi-region deployments.
• Integrated OpenTelemetry + Prometheus for real-time observability and alerts; 99.99% uptime.