Senior Site Reliability Engineer Ii

3 months ago


مصر, Egypt Careem Full time

Egypt

Careem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million Captains, simplified the lives of over 50 million customers, and built a platform for the region’s best talent to thrive and for entrepreneurs to scale their businesses. Careem operates in over 70 cities across 10 countries, from Morocco to Pakistan.

**About the team**:
We are looking for engineers who will work within the Cloud Engineering team. The team develops and maintains cloud-native technology for the Careem Service teams:

- Highly scalable Kubernetes clusters
- Cloud Access management automation and integration with k8s

**About the role**:
As an SRE, you’ll need to solve problems that arise using empirical data, teamwork, and your own unique expertise.

The Data Platform SRE will work directly with our data platform and engineering teams in an embedded SRE model, operating in unison with the developers to deliver seamless experiences for our customers.

**Key responsibilities include**:

- Make an impact from design phase, through development and operation of Data Platform over Kubernetes cluster and its ecosystem on AWS
- Build core services, and tooling and create technical processes that simplify and enable engineers across multiple services
- Identifying, automating and scaling system configurations without compromising on security and reliability.
- Participate in on-call rotations and help improve incident response

**Education and Experience**:
BS/MS in Computer Science or Equivalent (7+ years of software development or production operations experience in a large-scale environment)

**Qualifications**:

- Strong sense of ownership and integrity demonstrated through clear communication and collaboration
- Experience in architecting, developing, operating, and troubleshooting Kubernetes clusters and/or other highly available systems at scale.
- Proficiency with the architecture, deployment, performance tuning, and troubleshooting of open-source data analytics technologies, especially Apache Spark, Trino and related software in a large-scale environment
- The ability to design, author, and release code in languages like Go, Python, or Java
- Acute drive to automate manual operations and to improve them through repeated iteration
- Understanding of the Linux Operating System, standard networking protocols, and components
- Experience with cloud-native services on AWS/GCP
- Hands-on experience managing large numbers of diverse systems with configuration management or software delivery platforms (such as Terraform, Cloudformation, ArgoCD, and Flux)
- Excellent troubleshooting and problem-solving skills
- Experience with scale testing, disaster recovery, and capacity planning
- Effective communication and collaboration skills: have the ability to drive and promote technical partnerships across teams
- Incident response and/or incident management experience

**What we’ll provide you**

We offer colleagues the opportunity to drive impact in the region while they learn and grow. As a full time Careem colleague, you will be able to:

- Work and learn from great minds by joining a community of inspiring colleagues.
- Put your passion to work in a purposeful organisation dedicated to creating impact in a region with a lot of untapped potential.
- Explore new opportunities to learn and grow every day.
- Work 4 days a week in office & 1 day from home, and remotely from any country in the world for 30 days a year with unlimited vacation days per year.
- Access to healthcare benefits and fitness reimbursements for health activities including gym, health club, and training classes.



  • مصر, Egypt Careem Full time

    Careem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Convertedin Full time

    **About us**: Convertedin is a marketing operating system for e-Commerce. It utilizes data and shoppers' insights to create personalized multi-channel marketing that boosts customer engagement and maximizes their return on their marketing budget by leveraging artificial intelligence capabilities. Convertedin has helped more than 800 e-Commerce...

  • Site Reliability

    3 months ago


    مصر, Egypt ASWAT Full time

    **We Are Hiring: Site Reliability and DevOps Engineer** **About Us**: **ZIWO** is an Omni-channel Cloud Contact Center Software (CCAAS) providing straightforward solutions for companies to communicate with their clients via Phone, WhatsApp, SMS, and more. We connect 145 countries globally, including the GCC, enabling users to instantly expand their reach...


  • مصر, Egypt Convertedin Full time

    **About us**: Convertedin is a marketing operating system for e-Commerce. It utilizes data and shoppers' insights to create personalized multi-channel marketing that boosts customer engagement and maximizes their return on their marketing budget by leveraging artificial intelligence capabilities. Convertedin has helped more than 800 e-Commerce...


  • مصر, Egypt Convertedin Full time

    **About us**: Convertedin is a marketing operating system for e-Commerce. It utilizes data and shoppers' insights to create personalized multi-channel marketing that boosts customer engagement and maximizes their return on their marketing budget by leveraging artificial intelligence capabilities. Convertedin has helped more than 800 e-Commerce...


  • مصر, Egypt IBM Full time

    Introduction management tasks more efficiently. We’re seeking skilled, automation-focused Network Engineers to maintain and administer the Power Virtual Server Cloud Infrastructure-as-a-Service environment and provide reliable and secure network operations. The Network Infrastructure Operations Site Reliability Engineer works with clients to...


  • مصر, Egypt IBM Full time

    Introduction management tasks more efficiently. We’re seeking skilled, automation-focused Network Engineers to maintain and administer the Power Virtual Server Cloud Infrastructure-as-a-Service environment and provide reliable and secure network operations. The Network Infrastructure Operations Site Reliability Engineer works with clients to...

  • Reliability Engineer

    3 months ago


    مصر, Egypt PepsiCo Full time

    **Responsibilities**: - Key stakeholder in delivering PEMM results for the Maintenance Support Department. - Will have ownership of the Reliability section of the site maintenance improvement plan (MIAP) coming from every PeMM assessment. - Lead the site Asset Reliability program. Own and develop the site Major Incident Report (MIR), Analytical Problem...


  • مصر, Egypt IBM Full time

    Introduction management tasks more efficiently. We’re seeking skilled, automation-focused Network Engineers to maintain and administer the Power Virtual Server Cloud Infrastructure-as-a-Service environment and provide reliable and secure network operations. The Network Infrastructure Operations Site Reliability Engineer works with clients to...


  • مصر, Egypt Careem Full time

    Careem is building ‘the everything app’ for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Qoyod Full time

    Job Summary As Site Reliability/DevOps Engineer, you will introduce processes, tools, and methodologies to balance needs throughout the software development life cycle, from coding and deployment to maintenance and updates. **Responsibilities**: - Focus on improving the scalability, robustness, and automation of our tools and processes, as well as...


  • مصر, Egypt Careem Full time

    Careem is building ‘the everything app’ for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Careem Full time

    Careem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Careem Full time

    Careem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...

  • Reliability Engineer

    3 months ago


    مصر, Egypt Si-Ware Systems Full time

    We are seeking a highly motivated and detail-oriented Reliability Engineer to join our team. As a Reliability Engineer, you will play a crucial role in ensuring the reliability and durability of our products. You will collaborate with cross-functional teams to identify potential issues, design and execute tests, and analyze data to provide valuable insights...


  • مصر, Egypt Souq.com for E-Commerce LLC - G32 Full time

    3+ years of non-internship professional software development experience - 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience programming with at least one software programming language Amazon Middle East and North Africa team is looking for a Software Development...


  • مصر, Egypt Top Business Human Resources Full time

    **Job Description**: As a System Integration Engineer, you will play a crucial role in both; the building and running phases of our projects. You will be responsible for installing, configuring, and integrating solutions. Additionally, you will provide technical support for Cloud, On-Premises, and Hybrid services, ensuring the reliability, performance, and...


  • مصر, Egypt Careem Full time

    Careem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Vodafone Full time

    **Role purpose**: **Key accountabilities and decision ownership**: - Use software as a tool to manage systems, solve problems, and automate resolution to achieve zero touch operations - Design and enhance software architecture to improve scalability, service reliability, capacity, and performance - Defines, creates, promotes and monitors SLO’s/SLI’s...


  • مصر, Egypt Vodafone Full time

    **Role Purpose**: **Key Accountabilities and Decision Ownership**: - Use software as a tool to manage systems, solve problems, and automate resolution to achieve zero touch operations - Design and enhance software architecture to improve scalability, service reliability, capacity, and performance. - Defines, creates, promotes and monitors SLO’s/SLI’s...