Current jobs related to Site Reliability Engineer - United Arab Emirates - Xenon7
-
Site Reliability Engineer
1 week ago
United Arab Emirates, Dubai ManpowerGroup Middle East Full timeJob Description Site Reliability Engineer / AMS Support Engineer - Digital Healthcare Our client, a leading global healthcare technology company, is looking for an experienced Site Reliability Engineer / Application Management Services (AMS) Support Engineer to join their innovative team. The company is at the forefront of digital health, partnering with...
-
Site Reliability Engineer
1 week ago
United Arab Emirates, Dubai Dicetek LLC Full timeJob Description Job Summary We are looking for a Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of our production systems. The SRE will work closely with engineering, DevOps, and product teams to build highly available systems, automate operations, and improve system observability while maintaining service level...
-
Senior Site Reliability Engineer
2 weeks ago
Abu Dhabi, United Arab Emirates Astra Tech Full timeJob Description Role Summary We are looking for a Senior Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of botim's real-time communication and open platform infrastructure, supporting millions of active users globally. In this role, you will lead automation initiatives, operate and optimize large-scale Kubernetes...
-
Senior Site Reliability Engineer
3 days ago
, , United Arab Emirates Vng Solutions Full timeVSOL is a digital enabler with a mission to help public and private organizations evolve their businesses through data and technology. We provide an end-to-end service from consulting to execution that drives the growth and innovation of our clients. As VSOL is in a phase of rapid expansion, we offer a dynamic, creative environment that accelerates your...
-
Senior Site Reliability Engineer
1 week ago
united arab emirates Vng Solutions Full timeVSOL is a digital enabler with a mission to help public and private organizations evolve their businesses through data and technology. We provide an end-to-end service from consulting to execution that drives the growth and innovation of our clients. As VSOL is in a phase of rapid expansion, we offer a dynamic, creative environment that accelerates your...
-
Senior Site Reliability Engineer
3 days ago
, , United Arab Emirates Loft Orbital Solutions Full timeWanna join the adventure? Loft Orbital is revolutionizing access to space by building reliable, shareable satellites that drastically reduce the time and complexity traditionally required to get to orbit. We operate satellites, fly customer payloads, and handle entire missions from end-to-end. We’re a close-knit team of space enthusiasts, software experts,...
-
Senior Site Reliability Engineer
2 weeks ago
United Arab Emirates Xenon7 Full timeAbout us:Where elite tech talent meets world-class opportunitiesAt Xenon7, we work with leading enterprises and innovative startups on exciting, cutting-edge projects that leverage the latest technologies across various domains of IT including Data, Web, Infrastructure, AI, and many others. Our expertise in IT solutions development and on-demand resources...
-
Senior Engineer
22 hours ago
united arab emirates KBR Inc. Full timeSenior Engineer (Reliability) Mechanical (Rot & Static) / Instrumentation (I&C, F&G, Telecom) / Electrical Division KBR is looking for Senior Engineers to support the SMS Reliability Analysis Services Project, Adnoc, Abu Dhabi Summary Highly experienced professional who typically support the Reliability & Maintenance function by developing maintenance...
-
Site Reliability Engineer
2 weeks ago
, , United Arab Emirates Xenon7 Full timeDescription About us: Where elite tech talent meets world‑class opportunities! At Xenon7, we work with leading enterprises and innovative startups on exciting, cutting‑edge projects that leverage the latest technologies across various domains of IT including Data, Web, Infrastructure, AI, and many others. Our expertise in IT solutions development and...
-
Site Reliability Engineer
1 week ago
united arab emirates Xenon7 Full timeDescription About us: Where elite tech talent meets world‑class opportunities! At Xenon7, we work with leading enterprises and innovative startups on exciting, cutting‑edge projects that leverage the latest technologies across various domains of IT including Data, Web, Infrastructure, AI, and many others. Our expertise in IT solutions development and...
Site Reliability Engineer
2 weeks ago
Where elite tech talent meets world-class opportunities
At Xenon7, we work with leading enterprises and innovative startups on exciting, cutting-edge projects that leverage the latest technologies across various domains of IT including Data, Web, Infrastructure, AI, and many others. Our expertise in IT solutions development and on-demand resources allows us to partner with clients on transformative initiatives, driving innovation and business growth. Whether it's empowering global organizations or collaborating with trailblazing startups, we are committed to delivering advanced, impactful solutions that meet today's most complex challenges.
About the Client:Join one of Egypt's premier financial institutions, renowned for its extensive suite of banking services, including Institutional Banking, Personal Banking, and Islamic Banking. With a global presence through over 50 branches and correspondents, we serve a diverse and dynamic clientele. As we embark on a groundbreaking digital transformation journey, we are committed to leveraging the latest technologies to establish a state-of-the-art data architecture that will redefine our performance and service delivery.
Position OverviewThe Site Reliability Engineer (SRE) is responsible for ensuring the stability, performance, and reliability of Bank's critical applications, particularly Mobile Banking and Internet Banking platforms. This role bridges development and operations teams, implementing automation solutions, monitoring system health, and providing 24/7 operational support to maintain seamless banking services for customers on on-premise infrastructure.
Key Responsibilities· Monitor and maintain the reliability and performance of Mobile Banking and Internet Banking applications using Prometheus and Grafana dashboards
· Manage and support OpenShift/Kubernetes infrastructure for containerized banking applications on on-premise servers
· Respond to and resolve production incidents with minimal mean time to resolution (MTTR)
· Implement and maintain centralized logging solutions using ELK Stack (Elasticsearch, Logstash, Kibana) for application troubleshooting
· Develop and execute runbooks and automation scripts to reduce manual operational toil in OpenShift environments
· Provide 24/7 production support and on-call rotation for critical banking services
· Analyze logs and metrics from Prometheus and EFK to identify performance bottlenecks and reliability issues
· Conduct root cause analysis (RCA) on incidents and implement preventive measures
· Optimize Kubernetes/OpenShift deployments, pod management, and resource allocation on-premise
· Implement alerting strategies and threshold management in Prometheus and Grafana
· Support infrastructure scaling, capacity planning, and load balancing in production environments
· Implement security best practices and compliance requirements for financial systems in containerized environments
· Manage on-premise data center infrastructure and server resources
· Document operational procedures, troubleshooting guides, and create knowledge base articles
Qualifications· BSc in Computer Science, Information Technology, Software Engineering, or related field
· years of hands-on experience in SRE, DevOps, or Production Engineering roles
· Hands-on experience supporting production applications in Kubernetes/OpenShift environments
· Strong experience with OpenShift container platform administration and troubleshooting on on-premise infrastructure
· Proficiency with Prometheus for metrics collection and monitoring
· Proficiency with Grafana for dashboard creation and visualization
· Experience with ELK Stack (Elasticsearch, Logstash, Kibana) for centralized logging
· Strong understanding of Linux/Unix operating systems and networking fundamentals
· Practical experience with CI/CD tools and automation frameworks
· Proficiency in at least one programming/scripting language (Python, Go, or Bash)
· Experience with database management (SQL and NoSQL) on-premise
· Excellent troubleshooting and analytical skills for production support
· Strong communication skills and ability to work in cross-functional teams
· Experience in 24/7 production support environments
· Experience with on-premise data center infrastructure management
· Previous experience in financial services or banking sector is a plus