Passionate about distributed large scale systems, systems reliability, and writing code.
Experience
2025 — Now
London, United Kingdom
2021 — 2025
2021 — 2025
London, United Kingdom
Co-drove the creation of a Go CLI tool for improving and safely executing daily infrastructure operations:
• Top contributor to the project. Assisted other teams in developing new features and migrating legacy tools.
• Contributed to establishing the tool as the de-facto standard for all SRE infrastructure operations.
• Leading a Roadmap for building a robust framework and improving many key components.
Led the infrastructure work on a new edge service written in Go to improve edge resiliency:
◦ Worked on the project from the POC phase until production rollout in less than a year.
◦ Helped with reviewing the service code and reducing deployment time by 90%
Led efforts to build the first GCP production environment on Kubernetes:
◦ Worked with stake holders from different teams on building and testing the new environment.
◦ Cross teams efforts on integrating internal tooling.
◦ Collaborated with the QA team to ensure infrastructure and product readiness before going live.
Enhancements to the disaster recovery processes:
◦ Mentored an intern in creating a Go tool reducing the DR process from several hours to one hour.
◦ Various collaborations with Eng teams on addressing new requirements while ensuring safe and fast failover.
2019 — 2021
2019 — 2021
London, United Kingdom
Contributed to a cross-team project to move monolith apps from AWS EC2 to a Dockerized ECS architecture:
◦ Delivered the service mesh cluster using Consul and Envoy, and ensured production readiness.
◦ Planned and executed the service mesh cluster upgrade without downtime to +10 clusters.
Started a project to decouple two customer facing load balancer layers:
◦ The decoupling allows faster maintenance and scaling operations up to 6 times.
◦ Drove the initial architecture design and worked with the stake holders on addressing the requirements of the project.
2017 — 2019
2017 — 2019
Berlin, Germany
• Worked with Eng teams on various improvements to a microservice architecture of around 150 services running on AWS ECS.
• Created and tested Python tools for improving various infrastructure operations.
• Planned and executed major improvements on Cassandra, Kafka, Elasticsearch and Zookeeper clusters without down time.
• Improved observability for different teams by delivering critical application and load balancer metrics.
• Contributions to design and build the first production Kubernetes cluster on GKE and Azure.
2016 — 2017
2016 — 2017
Egypt
• Driving the infrastructure work on a data pipeline for tracking users events and reporting them to the business unit:
◦ Kafka, Elasticsearch and Cassandra on Kubernetes were used in the stack.
◦ Collaboration with the Eng team to design the data model and run benchmarking tests on the infrastructure.
• Created various Python tools for
Education
Mansoura University