New York, New York, United States
• Designed and implemented end-to-end data pipelines to aggregate and normalize geospatial risk datasets from multiple public sources, creating address-level features used in underwriting analysis.
• Built and optimized ML-based risk scoring models (CatBoost, XGBoost), improving regional hazard score accu- racy by 15% and enabling transparent, auditable underwriting decisions.
• Designed a Retrieval-Augmented Generation (RAG) pipeline to produce grounded, explainable risk narratives, reducing manual review time for underwriters.
• Built a FastAPI service to deliver JSON risk reports with configurable parameters and error handling.