Built and maintained backend services in Python supporting an enterprise analytics catalog and metadata platform used for data discovery and governance.
Developed internal automation to extract, normalize, and enrich metadata from source and target systems, increasing curated data objects by ~80% and reducing manual intervention.
Designed a LangChain-based internal service leveraging enterprise LLMs and PostgreSQL (AWS RDS) to analyze ETL pipeline telemetry and infer pipeline behavior, latency trends, and metadata gaps.
Implemented query strategies over OpenTelemetry-style schemas to support debugging and performance analysis of data pipelines.
Built and operated CI/CD pipelines using Jenkins and Terraform, provisioning EC2, Auto Scaling Groups, ALBs, and CloudWatch alerts, cutting deployment effort by 90% and accelerating releases by 75%.
Led remediation of infrastructure and pipeline security vulnerabilities, improving system resilience and audit readiness.