Site Reliability Engineer

2 days ago


Dubai, Dubai, United Arab Emirates Discovered MENA Full time

Get AI-powered advice on this job and more exclusive features.

Head of Software Engineering at Discovered MENA - Voted the region's best new talent provider

Site Reliability Engineer (SRE)

Location: Dubai

Duration: Permanent

We're currently partnered with a leading technology consultancy who are scaling their tech team. They offer a diverse work environment that provides services in the UAE impacting millions of lives. We're currently helping them search for a Site Reliability Engineer to join their ever-growing team.

Responsibilities:

  • Architect, implement, and oversee scalable, high-performance AI and data infrastructure across cloud (AWS) and on-prem environments.
  • Utilise automation tools (e.g., Terraform, Ansible) for provisioning, monitoring, and infrastructure optimisation.
  • Design robust monitoring, alerting, and logging solutions to detect and mitigate potential failures before they impact operations.
  • Develop and maintain seamless CI/CD pipelines to accelerate the deployment of AI models and data-driven applications.
  • Optimise workflows to enhance efficiency, reduce deployment friction, and maintain system stability.
  • Partner with AI researchers, data engineers, and developers to align infrastructure with project needs.
  • Act as a bridge between AI, data, and infrastructure teams, ensuring smooth communication and technical alignment.
  • Rapidly diagnose and resolve system incidents, conducting thorough root-cause analyses to prevent future issues.
  • Establish and refine disaster recovery frameworks to safeguard AI and data assets.
  • Implement stringent security protocols to protect AI and data infrastructure, ensuring compliance with industry regulations.
  • Perform regular security evaluations, proactively addressing vulnerabilities.
  • Identify opportunities to improve system scalability, efficiency, and resilience.
  • Stay ahead of emerging trends in AI infrastructure, site reliability engineering, and cloud technologies.

Qualifications & Skills:

  • Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
  • 3-5 years of experience in a similar role.
  • Experience with on-premise and cloud platforms (AWS, GCP, Azure) and container orchestration (Kubernetes, Docker).
  • Experience with AI and data-specific infrastructure (e.g., GPU clusters, data lakes).
  • Understanding of machine learning frameworks and data processing tools (e.g., TensorFlow, PyTorch, Apache Spark).
Seniority Level

Mid-Senior level

Employment Type

Full-time

Job Function

Engineering and Information Technology

#J-18808-Ljbffr

  • Dubai, Dubai, United Arab Emirates Canonical Full time

    Roles and responsibilities A Site Reliability Engineer (SRE) is responsible for ensuring that a company's systems, services, and infrastructure are reliable, scalable, and efficient. The role is a hybrid between software engineering and operations, with an emphasis on improving the reliability and performance of services through automation, monitoring, and...


  • Dubai, Dubai, United Arab Emirates Canonical Full time

    Roles and responsibilitiesASite Reliability Engineer (SRE) isresponsible for ensuring that a company's systems,services, and infrastructure are reliable, scalable, and efficient.The role is a hybrid between software engineering and operations,with an emphasis on improving the reliability and performance ofservices through automation, monitoring, and...


  • Dubai, Dubai, United Arab Emirates Canonical Full time

    Roles and responsibilitiesASite Reliability Engineer (SRE) isresponsible for ensuring that a company's systems,services, and infrastructure are reliable, scalable, and efficient.The role is a hybrid between software engineering and operations,with an emphasis on improving the reliability and performance ofservices through automation, monitoring, and...


  • Dubai, Dubai, United Arab Emirates Canonical Full time

    Roles and responsibilities ASite Reliability Engineer (SRE) isresponsible for ensuring that a company's systems,services, and infrastructure are reliable, scalable, and efficient.The role is a hybrid between software engineering and operations,with an emphasis on improving the reliability and performance ofservices through automation, monitoring, and...


  • Dubai, Dubai, United Arab Emirates Dice Full time

    Our team is seeking a highly skilled Site Reliability Engineer to support critical API Platform, DevOps, and other activities for the Digital Services Group.This role will involve providing consulting services for improved system stability, availability, performance, and reliability.Key responsibilities include:Providing input into the resolution of...


  • Dubai, Dubai, United Arab Emirates bhft Full time

    BHFT is a proprietary algorithmic trading firm. Our team manages the full trading cycle, from software development to creating and coding strategies and algorithms.Our trading operations cover key exchanges. The firm trades across a broad range of asset classes, including equities, equity derivatives, options, commodity futures, rates futures, etc. We employ...


  • Dubai, Dubai, United Arab Emirates Exinity Group Full time

    In the fast-growing economies of the world, there's a new generation of ambitious younger people eager to gain financial independence. And they're turning to the world's financial markets to achieve it. Exinity's mission is to empower them to succeed. We design, engineer and market a growing range of innovative trading and investing products that meet their...


  • Dubai, Dubai, United Arab Emirates Exinity Group Full time

    In the fast-growing economies of the world, there's a new generation of ambitious younger people eager to gain financial independence. And they're turning to the world's financial markets to achieve it. Exinity's mission is to empower them to succeed. We design, engineer and market a growing range of innovative trading and investing products that meet their...


  • Dubai, Dubai, United Arab Emirates Investsky Full time

    About UsWe strive to create a seamless investing experience.Our aim is to provide a comprehensive investment solution for MENA investors.We believe that investing should be easy, efficient, and enjoyable.Job Description:Design and implement reliable systems and processes with the development team.Analyze platform performance to identify and resolve...

  • Reliability Engineer

    3 weeks ago


    Dubai, Dubai, United Arab Emirates Najmaconsultancy Full time

    We are looking for a Reliability Engineer for an aluminium production and refinery company. The ideal candidate will play a crucial role in enhancing business processes, developing maintenance strategies, and supporting our maintenance managers in reliability analysis.ResponsibilitiesEnhances business processes.Develops maintenance strategies and...


  • Dubai, Dubai, United Arab Emirates Exinity Group Full time

    In the fast-growing economies of the world, there's a new generation of ambitious younger people eager to gain financial independence. And they're turning to the world's financial markets to achieve it. Exinity's mission is to empower them to succeed. We design, engineer and market a growing range of innovative trading and investing products that meet their...


  • Dubai, Dubai, United Arab Emirates Najmaconsultancy Full time

    We are looking for a Reliability Engineer for an aluminium production and refinery company. The ideal candidate will play a crucial role in enhancing business processes, developing maintenance strategies, and supporting our maintenance managers in reliability analysis. Responsibilities Enhances business processes. Develops maintenance strategies and...


  • Dubai, Dubai, United Arab Emirates Najmaconsultancy Full time

    We are looking for a Reliability Engineer for an aluminium production and refinery company. The ideal candidate will play a crucial role in enhancing business processes, developing maintenance strategies, and supporting our maintenance managers in reliability analysis. Responsibilities Enhances business processes. Develops maintenance strategies and...


  • Dubai, Dubai, United Arab Emirates Dice Full time

    Mandatory SkillsKubernetes, Java Api, Cloud Services, DevopsToolsOptionalSkills Aws, Agile Scrum, ApiGatewayClienttelecommunications practice is looking for dynamic and drivenprofessionals to join a rapidly growing high-performanceteam.Our clientis a leading provider of digital Global System for MobileCommunications/General Packet Radio Service (GSM/GPRS)...


  • Dubai, Dubai, United Arab Emirates bhft Full time

    Site Reliability Engineer ResponsibilitiesAt BHFT, the Site Reliability Engineer plays a pivotal role in ensuring the reliability and performance of our trading platform. Key tasks include:Ensuring the continuous compliance of our platform with external regulatory requirements and internal standards.Developing and refining monitoring and alerting systems to...

  • Reliability Engineer

    18 hours ago


    Dubai, Dubai, United Arab Emirates Amazon Full time

    About the RoleWe are seeking a skilled Maintenance Technician to join our Reliability Maintenance Engineering team. As a key member of our team, you will be responsible for ensuring the safe operation of equipment within our warehouses and delivery network.Promote safe working practices and adhere to Amazon safety standards.Perform planned preventative...


  • Dubai, Dubai, United Arab Emirates Discovered MENA Full time

    Mid-Senior Level PositionWe're seeking a qualified Site Reliability Engineer to join our team and develop scalable AI and data infrastructure.The successful candidate will be responsible for architecting, implementing, and overseeing high-performance infrastructure solutions for AI and data applications.Key RequirementsBachelor's or Master's degree in...

  • Site Engineer

    1 week ago


    Dubai, Dubai, United Arab Emirates Estemarat Group Full time

    Dubai, United Arab Emirates | Posted on 02/24/2025 We are seeking a highly skilled and detail-oriented Site Engineer to oversee construction site operations, ensure project execution as per plans, and maintain quality and safety standards. The ideal candidate should have experience in site supervision, project coordination, and compliance with UAE...


  • Dubai, Dubai, United Arab Emirates Jobtrack Management Services Full time

    Safety and Reliability EngineerWe are seeking a highly skilled Safety and Reliability Engineer to join our team at Jobtrack Management Services.The successful candidate will be responsible for providing safety and loss prevention information and support, including interpretations of codes, standards, and practices on existing plants, facility modifications,...

  • Site Engineer

    1 week ago


    Dubai, Dubai, United Arab Emirates Skills Hub Recruitment Solutions Full time

    Oversee day-to-day operations at the project site to ensure smooth execution of Civil and MEP (Mechanical, Electrical, Plumbing) works. Ensure work is being completed according to the projects timeline, budget, and quality standards. Supervise subcontractors, laborers, and other site personnel to ensure work is performed effectively and safely. Site...