Current jobs related to Site Reliability Engineer - Dubai, Dubai - Discovered MENA

  • Reliability Engineer

    4 weeks ago


    Dubai, Dubai, United Arab Emirates Dox Reliability Konsult Full time

    Direct message the job poster from Dox Reliability Konsult I help plants implement strategies to reduce cost by 43.10% Company Description Dox Reliability Konsult is dedicated to helping businesses optimize operations and achieve full potential through maintenance best practice consulting, third-party recruitment, and personalized personnel training...


  • Dubai, Dubai, United Arab Emirates Discovered MENA Full time

    We're currently partnered with a leading technology consultancy who are scaling their tech team. They offer a diverse work environment that provides services in the UAE impacting millions of lives. We're currently helping them search for a Site Reliability Engineer to join their ever-growing team.Responsibilities:- Architect, implement, and oversee scalable,...


  • Dubai, Dubai, United Arab Emirates Discovered MENA Full time

    We're currently partnered with a leading technology consultancy who are scaling their tech team. They offer a diverse work environment that provides services in the UAE impacting millions of lives. We're currently helping them search for a Site Reliability Engineer to join their ever-growing team.Responsibilities:Architect, implement, and oversee scalable,...


  • Dubai, Dubai, United Arab Emirates Client of Discovered MENA Full time

    Job descriptionLocation: DubaiDuration: PermanentWe are currently partnered with a leading technology consultancy who are scaling their tech team. They offer a diverse work environment that provides services in the UAE impacting millions of lives. We're currently helping them search for a Site Reliability Engineer to join their ever-growing...


  • Dubai, Dubai, United Arab Emirates myZoi Full time

    Site Reliability Engineer (SRE)Dubai, United Arab Emirates | Posted on 04/04/2025myZoi is changing lives for the better for those who deserve it the most. We are an exciting fintech start-up aiming to promote financial inclusion globally. Our vision is to provide a level playing field to the unbanked and the underbanked in accessing essential financial...


  • Dubai, Dubai, United Arab Emirates myZoi Full time

    Site Reliability Engineer (SRE) Dubai, United Arab Emirates | Posted on 04/04/2025 myZoi is changing lives for the better for those who deserve it the most. We are an exciting fintech start-up aiming to promote financial inclusion globally. Our vision is to provide a level playing field to the unbanked and the underbanked in accessing essential financial...


  • Dubai, Dubai, United Arab Emirates beBee Careers Full time

    A Site Reliability Engineering Expert is needed to design and implement scalable AI and data infrastructure across cloud (AWS) platforms, ensuring high-performance and reliability.ResponsibilitiesImplement automation tools like Terraform and Ansible for provisioning, monitoring, and infrastructure optimization.Develop and maintain seamless CI/CD pipelines to...


  • Dubai, Dubai, United Arab Emirates beBee Careers Full time

    Senior Site Reliability EngineerJob SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team. This individual will be responsible for designing, implementing, and maintaining cloud infrastructure using AWS and Collocated Data Center.The ideal candidate will have a strong background in software engineering, systems...


  • Dubai, Dubai, United Arab Emirates Thrive Learning Limited Full time

    About this role As a Site Reliability Engineer within the SRE team, you'll be focused on monitoring and supporting our AWS environments for platforms and tools utilised by our customers. The SRE team specialises in giving delivery squads visibility of the performance of their services in production and support to investigate and contain potential problems....


  • Dubai, Dubai, United Arab Emirates UMATR Full time

    Direct message the job poster from UMATR Senior Consultant at UMATR | AI, Data Science & Software Site Reliability Engineer - Principal - Dubai (must already be based in the UAE) We've partnered with an impactful, well-funded start-up, based in Dubai to help them grow their team. We're looking to hire an exceptional Principal SRE, someone accustomed to...

Site Reliability Engineer

2 weeks ago


Dubai, Dubai, United Arab Emirates Discovered MENA Full time
Job Description

Site Reliability Engineer (SRE)

Location: Dubai

Duration: Permanent

We're currently partnered with a leading technology consultancy who are scaling their tech team. They offer a diverse work environment that provide services in the UAE impacting millions of lives. We're currently helping them search for a Site Reliability Engineer to join their ever growing team.

Responsibilities:

- Architect, implement, and oversee scalable, high-performance AI and data infrastructure across cloud (AWS) and on-prem environments.
- Utilise automation tools (e.g., Terraform, Ansible) for provisioning, monitoring, and infrastructure optimisation.
- Design robust monitoring, alerting, and logging solutions to detect and mitigate potential failures before they impact operations.
- Develop and maintain seamless CI/CD pipelines to accelerate the deployment of AI models and data-driven applications.
- Optimise workflows to enhance efficiency, reduce deployment friction, and maintain system stability.
- Partner with AI researchers, data engineers, and developers to align infrastructure with project needs.
- Act as a bridge between AI, data, and infrastructure teams, ensuring smooth communication and technical alignment.
- Rapidly diagnose and resolve system incidents, conducting thorough root-cause analyses to prevent future issues.
- Establish and refine disaster recovery frameworks to safeguard AI and data assets.
- Implement stringent security protocols to protect AI and data infrastructure, ensuring compliance with industry regulations.
- Perform regular security evaluations, proactively addressing vulnerabilities.
- Identify opportunities to improve system scalability, efficiency, and resilience.
- Stay ahead of emerging trends in AI infrastructure, site reliability engineering, and cloud technologies.

Qualifications & skills:

- Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
- 3-5 years experience in a similar role
- Experience with on-premise and cloud platforms (AWS, GCP, Azure) and container orchestration (Kubernetes, Docker).
- Experience with AI and data-specific infrastructure (e.g., GPU clusters, data lakes)
- Understanding of machine learning frameworks and data processing tools (e.g., TensorFlow, PyTorch, Apache Spark).