Site Reliability Engineer

5 days ago


Dubai, Dubai, United Arab Emirates noon Full time
is a technology leader with a simple mission: to be the best place to buy and sell things. In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' online needs.

noon operates without boundaries; we are aggressively and voraciously ambitious. Starting in 2017 with , the region's homegrown e-commerce platform and leading online shopping destination, noon is now a digital ecosystem of products and services - noon, noon Food, Noon in Minutes, NowNow, SIVVI, noon One, and noon Pay.

At noon we have the courage to pursue what seems impossible, we work hard to get things done, we go to great lengths to ensure that the experience of everyone from our customers to our sellers or noon Bandidos is stellar but above all, we are grateful for the opportunities we have. If you feel the above values resonate with you – you will enjoy this incredible journey with us

Job Description

As a Site Reliability Engineer (SRE) at noon payments, you will play a crucial role in maintaining and enhancing the reliability, availability, and performance of our cloud-based infrastructure and services.

You will be responsible for automating deployments, optimizing systems, and ensuring seamless performance across our platforms. This position requires a strong foundation in cloud infrastructure management, particularly with Azure - AKS and GCP-GKE, alongside hands-on experience with Azure DevOps and monitoring tools like Datadog.

You will:

1. Cloud Infrastructure Management: Manage and optimize cloud environments across Azure and GCP, ensuring efficient resource utilization, high system availability, and scalability (AKS-GKE).
2. Infrastructure as Code: Utilize Terraform for infrastructure provisioning, ensuring consistent and scalable deployments, and managing infrastructure via Azure DevOps pipelines.
3. Configuration Management: Implement and manage system configurations using Ansible to ensure consistency and streamline updates across different environments.
4. Continuous Integration/Continuous Deployment (CI/CD): Develop, maintain, and optimize CI/CD pipelines within Azure DevOps to automate testing and deployment processes, reducing time from development to production.
5. Monitoring and Observability: Set up and maintain comprehensive monitoring and observability solutions using Datadog to track system health, performance, and proactively detect issues.
6. Container Orchestration: Deploy, manage, and optimize Kubernetes clusters to support scalable and resilient application deployments.
7. Incident Management: Participate in a 24/7 on-call or roster-based team to respond to incidents, conduct root cause analysis, and implement solutions to minimize downtime and ensure system reliability.
8. Performance Tuning: Continuously monitor system performance, identify bottlenecks, and implement optimizations to improve efficiency and response times.
9. Capacity Planning: Plan and manage system capacity to ensure resources meet current and future demands, enabling seamless service delivery.
10. Collaboration: Work closely with Network Operations Center (NOC) and DevOps teams to troubleshoot issues, optimize deployment processes, and drive continuous improvement.
11. Documentation: Create and maintain detailed documentation for system configurations, deployment processes, and incident reports.

Skill Requirements

1. Bachelor's degree in computer science, Information Technology or any other related discipline or equivalent related experience.
2. Cloud, ITIL, CKA certifications are a plus.
3. 6+ years of directly related or relevant experience, preferably in information security.
4. Extensive experience with cloud platforms such as Azure, GCP, and Huawei Cloud.
5. Proficiency with Terraform for infrastructure automation and Ansible for configuration management.
6. Hands-on experience with Kubernetes for container orchestration mainly AKS and GKE.
7. Expertise in monitoring and observability tools such as Datadog.
8. Familiarity with Azure VMSS, GCP MIG for virtual machine scaling and management.
9. Experience in a 24/7 on-call or roster-based team environment, focusing on system uptime and incident response.
10. Strong understanding of SRE processes and best practices for system reliability, availability, and performance.
11. Excellent problem-solving skills and the ability to handle complex technical issues under pressure.
12. Effective communication skills and a collaborative approach to working with diverse teams.
13. Experience with payment gateway projects or similar high-transaction systems is preferred.
14. Additional knowledge in advanced monitoring techniques, performance tuning, and capacity planning is a plus.

Who will excel?

We're looking for candidates who thrive in a fast-paced, dynamic start-up environment. We're searching for problem solvers, people who operate with a bias for action and have a deep understanding of the importance of resourcefulness over reliance.

Candor is our only default. Demanding unequivocal high standards should be non-negotiable because quality matters. We want people who are radically candid, cohorts who commit to settling for nothing but the best - in hiring, in accepting work from colleagues, and in your own work.

Ours is not an easy mission, but it is a meaningful one. Every hire must actively raise the bar of talent in the company to help us reach our vision.

Seniority level

Mid-Senior level

Employment type

Full-time

Job function

Industries: Internet Marketplace Platforms

Referrals increase your chances of interviewing at noon by 2x.

#J-18808-Ljbffr

  • Dubai, Dubai, United Arab Emirates Canonical Full time

    Roles and Responsibilities1. A Site Reliability Engineer (SRE) is responsible for ensuring that a company's systems, services, and infrastructure are reliable, scalable, and efficient. The role is a hybrid between software engineering and operations, with an emphasis on improving the reliability and performance of services through automation, monitoring, and...


  • Dubai, Dubai, United Arab Emirates Canonical Full time

    Roles and ResponsibilitiesA Site Reliability Engineer (SRE) is responsible for ensuring that a company's systems, services, and infrastructure are reliable, scalable, and efficient. The role is a hybrid between software engineering and operations, with an emphasis on improving the reliability and performance of services through automation, monitoring, and...


  • Dubai, Dubai, United Arab Emirates noon Full time

    As a Site Reliability Engineer at noon payments, you will be responsible for automating deployments, optimizing systems, and ensuring seamless performance across our platforms. This position requires a strong foundation in cloud infrastructure management, particularly with Azure - AKS and GCP-GKE, alongside hands-on experience with Azure DevOps and...


  • Dubai, Dubai, United Arab Emirates Canonical Full time

    Roles and responsibilitiesASite Reliability Engineer (SRE) isresponsible for ensuring that a company's systems,services, and infrastructure are reliable, scalable, and efficient.The role is a hybrid between software engineering and operations,with an emphasis on improving the reliability and performance ofservices through automation, monitoring, and...


  • Dubai, Dubai, United Arab Emirates Canonical Full time

    Job OverviewA Site Reliability Engineer at Canonical ensures the company's systems, services, and infrastructure are reliable, scalable, and efficient. This role combines software engineering and operations to improve reliability and performance through automation, monitoring, and proactive issue resolution.


  • Dubai, Dubai, United Arab Emirates bhft Full time

    BHFT is a proprietary algorithmic trading firm. Our team manages the full trading cycle, from software development to creating and coding strategies and algorithms.Our trading operations cover key exchanges. The firm trades across a broad range of asset classes, including equities, equity derivatives, options, commodity futures, rates futures, etc. We employ...


  • Dubai, Dubai, United Arab Emirates noon Full time

    noon.com is a technology leader with a simple mission: to be the best place to buy and sell things. In doing this we hope to accelerate the digital economy of the Middle East, empowering regional talent and businesses to meet the full range of consumers' online needs.noon operates without boundaries; we are aggressively and voraciously ambitious. Starting in...


  • Dubai, Dubai, United Arab Emirates Dice Full time

    Mandatory Skills: Kubernetes, Java API, Cloud Services, DevOps ToolsOptional Skills: AWS, Agile Scrum, API GatewayClient telecommunications practice is looking for dynamic and driven professionals to join a rapidly growing high-performance team.Our client is a leading provider of digital Global System for Mobile Communications/General Packet Radio Service...


  • Dubai, Dubai, United Arab Emirates Dice Full time

    Mandatory Skills: Kubernetes, Java API, Cloud Services, DevOps Tools Optional Skills: AWS, Agile Scrum, API Gateway Client telecommunications practice is looking for dynamic and driven professionals to join a rapidly growing high-performance team. Our client is a leading provider of digital Global System for Mobile Communications/General Packet...


  • Dubai, Dubai, United Arab Emirates Q-Express Documents Transport Full time

    Job Overview:We are seeking a Reliability Engineering Lead to play a critical role in managing operational building facility services across multiple sites and countries.The ideal candidate will have 5+ years' experience in facilities management, including both hard and soft services, vendor management in large and complex 24/7 operational facilities, and...


  • Dubai, Dubai, United Arab Emirates Exinity Group Full time

    In the fast-growing economies of the world, there's a new generation of ambitious younger people eager to gain financial independence. And they're turning to the world's financial markets to achieve it. Exinity's mission is to empower them to succeed. We design, engineer and market a growing range of innovative trading and investing products that meet their...


  • Dubai, Dubai, United Arab Emirates ValueLabs Full time

    Direct message the job poster from ValueLabs Senior Executive Human Resources at ValueLabs We are seeking an innovative and experienced DevOps Engineer to support Advance Analytics COE Squads. Please find the below Job Description The Senior Technology Engineer will be responsible for managing deliverables of the squad which includes building and...


  • Dubai, Dubai, United Arab Emirates etihad Full time

    SynopsisThe Maintenance Program & Reliability Engineer plays a key role in ensuring compliance with GCAA CAR M regulations by developing, analyzing, and optimizing aircraft maintenance programs. This role focuses on enhancing fleet reliability by identifying trends, implementing technical solutions, and driving continuous improvements in maintenance...


  • Dubai, Dubai, United Arab Emirates etihad Full time

    SynopsisThe Maintenance Program & Reliability Engineer plays a key role in ensuring compliance with GCAA CAR M regulations by developing, analyzing, and optimizing aircraft maintenance programs. This role focuses on enhancing fleet reliability by identifying trends, implementing technical solutions, and driving continuous improvements in maintenance...


  • Dubai, Dubai, United Arab Emirates Exinity Group Full time

    In the fast-growing economies of the world, there's a new generation of ambitious younger people eager to gain financial independence. And they're turning to the world's financial markets to achieve it. Exinity's mission is to empower them to succeed. We design, engineer and market a growing range of innovative trading and investing products that meet their...


  • Dubai, Dubai, United Arab Emirates Exinity Group Full time

    In the fast-growing economies of the world, there's a new generation of ambitious younger people eager to gain financial independence. And they're turning to the world's financial markets to achieve it. Exinity's mission is to empower them to succeed. We design, engineer and market a growing range of innovative trading and investing products that meet their...

  • Reliability Engineer

    3 weeks ago


    Dubai, Dubai, United Arab Emirates Najmaconsultancy Full time

    We are looking for a Reliability Engineer for an aluminium production and refinery company. The ideal candidate will play a crucial role in enhancing business processes, developing maintenance strategies, and supporting our maintenance managers in reliability analysis. Responsibilities Enhances business processes. Develops maintenance strategies and...


  • Dubai, Dubai, United Arab Emirates Najmaconsultancy Full time

    We are looking for a Reliability Engineer for an aluminium production and refinery company. The ideal candidate will play a crucial role in enhancing business processes, developing maintenance strategies, and supporting our maintenance managers in reliability analysis.Responsibilities1. Enhances business processes.2. Develops maintenance strategies and...


  • Dubai, Dubai, United Arab Emirates Najmaconsultancy Full time

    We are looking for a Reliability Engineer for an aluminium production and refinery company. The ideal candidate will play a crucial role in enhancing business processes, developing maintenance strategies, and supporting our maintenance managers in reliability analysis.ResponsibilitiesEnhances business processes.Develops maintenance strategies and...


  • Dubai, Dubai, United Arab Emirates Najmaconsultancy Full time

    We are looking for a Reliability Engineer for an aluminium production and refinery company. The ideal candidate will play a crucial role in enhancing business processes, developing maintenance strategies, and supporting our maintenance managers in reliability analysis.ResponsibilitiesEnhances business processes.Develops maintenance strategies and...