San Jose, California, United States
• Built a Document-AI pipeline for staffing packets (offers, SOWs, POs, background letters) using OCR + LayoutLM/Donut with rule fallbacks, cutting median processing time by 60% and enabling 53% straight-through processing (STP).
• Orchestrated end-to-end Airflow workflows (ingest, classify, extract, validate, write back) with field accuracy above 98%across 25 key fields.
• Built a validation engine (Pydantic + Great Expectations) enforcing business rules (rate within SOW, start/end dates, approvals), plus OpenSearch for clause/field search and full auditability.
• Stood up monitoring for model/data drift, accuracy by doc type, reviewer-edit deltas, and SLA dashboards, enabling proactive quality and reliability management.