Tech: Java, Go, Python, Spring, PostgreSQL, ElasticSearch (ELK), Kafka/Event Streaming, Terraform, Kubernetes, Cloud APIs.
Distributed Observability & Telemetry Platform:
Architected and built a customer-facing, distributed observability platform for real-time ingestion, processing, and fan-out delivery of high-volume metrics and events across global infrastructure.
•Designed scalable telemetry pipelines with support for high-cardinality data, partial failures, retries, and backpressure.
•Implemented multi-sink fan-out delivery to heterogeneous consumers including Datadog, Splunk, PagerDuty, and custom webhooks.
•Owned API design, data models, and reliability guarantees for event streaming and subscription workflows.
•Led integration of CloudEvents-based schemas for extensibility and interoperability.
🔗 https://docs.equinix.com/observability/
🔗 https://github.com/equinix/equinix-cloudevents
Autonomous & Intelligent Operations (AI/ML Initiatives):
Contributed to early-stage intelligent alerting and automation initiatives, integrating ML models and agents to improve signal quality, anomaly detection, and operational efficiency.
•Partnered with data science teams on model-driven alerting pipelines.
•Designed system interfaces to support future autonomous control loops for infrastructure operations.
Distributed Control Plane – Fabric Cloud Router (FCR):
Led the development of core backend services powering FCR, a globally distributed, multi-cloud platform and one of Equinix’s fastest-growing products.
•Built and scaled control-plane microservices responsible for lifecycle management, state reconciliation, and API-driven provisioning.
•Designed idempotent APIs, data models, and workflows supporting multi-cloud connectivity across AWS, GCP, and Azure.
•Led development of Terraform providers and external-facing APIs to enable programmatic adoption by customers.
🔗 https://docs.equinix.com/en-us/Content/Interconnection/FCR/FCR-intro.htm