Led the implementation of Observability for SAP SuccessFactors, leveraging a cutting-edge technology stack and collaborating with various cloud-native vendors to innovate and customize observability agents tailored to SAP JVM requirements. Designed and deployed an observability framework driven by OpenTelemetry standards, integrating ElasticSearch APM, Zabbix, SAP JVM tuning, and OpenTelemetry agent solutions for both remote and local setups.
Developed microservices-focused observability dashboards aligned with SRE golden signals to enhance monitoring and performance visibility. Conducted JVM tuning and applied machine learning/analytics to observability data to predict potential failures and enable automated remediation, significantly reducing Mean Time to Recovery (MTTR).