Site Reliability Engineer

3 weeks ago


مصر, Egypt Convertedin Full time

**About us**:
Convertedin is a marketing operating system for e-Commerce. It utilizes data and shoppers' insights to create personalized multi-channel marketing that boosts customer engagement and maximizes their return on their marketing budget by leveraging artificial intelligence capabilities. Convertedin has helped more than 800 e-Commerce worldwide.

**About the job**

We are looking for a Site Reliability Engineer to join our team and develop software systems and automated solutions for operational aspects in an organization.

Site Reliability Engineer responsibilities include monitoring computer systems and building alerts for various operational issues that computer systems can experience.

Ultimately, you will work with our IT team to ensure our organization can continue to deliver products and services in our computer system environment.

**Responsibilities**:

- Participate in system design consulting, platform management, and capacity planning
- Partner with engineering teams to improve services through rigorous testing and release procedures.
- Run the production environment by monitoring availability and taking an in-depth view of system health.
- Build monitoring that alerts on symptoms/issues rather than on outages.
- Debug production issues across services and levels of the hosting stack.
- Improve reliability, quality, and time-to-market of our suite of software solutions.
- Design, build, plan, and maintain core infrastructure that enables ConvertedIn scaling to support thousands of concurrent users.
- Create sustainable systems and services through adaptation of automation.
- Balance feature development speed and reliability with well-defined service-level objectives.
- Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual enhancement.
- Be on an on-call rotation to respond to incidents that impact ConvertedIn services availability.
- Document all the things so you don’t need to learn the same thing twice.
- Deliver root cause analysis and corrective actions for reported incidents

**Requirements**:

- 3+ years of experience as a devops engineer
- Good command of english
- Configuration management: Ansible to manage our infrastructure hosting.
- Infrastructure as code: use Terraform and Github actions for automation, and leverage cloud technologies to meet our goals.
- Systems: manage, configure and troubleshoot operating system issues, storage (block and object), networking (VPCs, proxies and CDNs), and operating system (Linux) configuration, package management, startup and troubleshooting.
- Cloud services: Cloud resources provisioning and configuration through CLI/API
- Solid understanding of the software development lifecycle.
- Monitoring and instrumentation: implement metrics in Prometheus, Grafana, log management and related system, and third-parties integrations like (slack, ,datadog, sentry, pagerduty)
- Strong Knowledge of managing CI/CD tools
- Engineering practices: availability, reliability and scalability, as well as disaster recovery.
- Work in a variety of languages: Shell, Python

**Benefits**
- Work from Anywhere, yes this remote base opportunity with the availability of hybrid options.
- We love flexibility Manage your own work schedule. We trust the people we hire.
- Competitive package.
- Paid time off.
- Fun benefits & perks that you will enjoy
- Trusting, ego-free and truth-seeking team members
- See something you want to improve? Awesome. We’re a flexible and collaborative team that is always learning and growing.
- Need more convincing? Check this out
- Convertedin is an equal employment opportunity employer such that all qualified applicants will receive consideration for employment without regard to race, color, age, religion, sex, sexual orientation, gender identity/expression, national origin or disability.


  • Site Reliability

    1 week ago


    مصر, Egypt ASWAT Full time

    **We Are Hiring: Site Reliability and DevOps Engineer** **About Us**: **ZIWO** is an Omni-channel Cloud Contact Center Software (CCAAS) providing straightforward solutions for companies to communicate with their clients via Phone, WhatsApp, SMS, and more. We connect 145 countries globally, including the GCC, enabling users to instantly expand their reach...


  • مصر, Egypt DXC Technology Full time

    **Job brief** We are looking for a Juinor Site Reliability Engineer for our Early Grads Program. **Responsibilities**: - Set up, operate and manage environments. - Handle code deployments in all environments. - Operate over all types of infrastructures like on-prem cloud and/or container-based platforms - Develop automation with a focus on scalability,...


  • مصر, Egypt IBM Full time

    Introduction management tasks more efficiently. We’re seeking skilled, automation-focused Network Engineers to maintain and administer the Power Virtual Server Cloud Infrastructure-as-a-Service environment and provide reliable and secure network operations. The Network Infrastructure Operations Site Reliability Engineer works with clients to...


  • مصر, Egypt IBM Full time

    Introduction management tasks more efficiently. We’re seeking skilled, automation-focused Network Engineers to maintain and administer the Power Virtual Server Cloud Infrastructure-as-a-Service environment and provide reliable and secure network operations. The Network Infrastructure Operations Site Reliability Engineer works with clients to...

  • Reliability Engineer

    2 weeks ago


    مصر, Egypt PepsiCo Full time

    **Responsibilities**: - Key stakeholder in delivering PEMM results for the Maintenance Support Department. - Will have ownership of the Reliability section of the site maintenance improvement plan (MIAP) coming from every PeMM assessment. - Lead the site Asset Reliability program. Own and develop the site Major Incident Report (MIR), Analytical Problem...


  • مصر, Egypt IBM Full time

    Introduction management tasks more efficiently. We’re seeking skilled, automation-focused Network Engineers to maintain and administer the Power Virtual Server Cloud Infrastructure-as-a-Service environment and provide reliable and secure network operations. The Network Infrastructure Operations Site Reliability Engineer works with clients to...


  • مصر, Egypt Convertedin Full time

    **About us**: Convertedin is a marketing operating system for e-Commerce. It utilizes data and shoppers' insights to create personalized multi-channel marketing that boosts customer engagement and maximizes their return on their marketing budget by leveraging artificial intelligence capabilities. Convertedin has helped more than 800 e-Commerce...


  • مصر, Egypt Convertedin Full time

    **About us**: Convertedin is a marketing operating system for e-Commerce. It utilizes data and shoppers' insights to create personalized multi-channel marketing that boosts customer engagement and maximizes their return on their marketing budget by leveraging artificial intelligence capabilities. Convertedin has helped more than 800 e-Commerce...


  • مصر, Egypt Qoyod Full time

    Job Summary As Site Reliability/DevOps Engineer, you will introduce processes, tools, and methodologies to balance needs throughout the software development life cycle, from coding and deployment to maintenance and updates. **Responsibilities**: - Focus on improving the scalability, robustness, and automation of our tools and processes, as well as...


  • مصر, Egypt Evolvice Full time

    Evolvice is a German nearshore service provider with branches in Egypt and Ukraine. Founded in 2012, Evolvice has a strong technical background and business domain knowledge, combining software engineering and Agile methodology, leading its’ clients’ path to digital transformation. Headquartered in the heart of the automobile industry, Stuttgart...

  • Reliability Engineer

    2 weeks ago


    مصر, Egypt Si-Ware Systems Full time

    We are seeking a highly motivated and detail-oriented Reliability Engineer to join our team. As a Reliability Engineer, you will play a crucial role in ensuring the reliability and durability of our products. You will collaborate with cross-functional teams to identify potential issues, design and execute tests, and analyze data to provide valuable insights...


  • مصر, Egypt VMware Full time

    **Why will you enjoy this new opportunity?** **Success in the Role: What are the performance outcomes over the first 6-12 months you will work toward completing?** **The Work: What type of work will you be doing? What assignments, requirements, or skills will you be performing on a regular basis?** Your regular activities may be modified to suit your...


  • مصر, Egypt Careem Full time

    Careem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Procore Technologies Full time

    **Job Description**: We’re looking for a **Senior Site Reliability Engineer** to join Procore’s Fintech cloud infrastructure team. In this role, you’ll work collaboratively with software engineers, software testing engineers, and product/project managers, to build, design, and shape the cloud infrastructure. Your role will also include improving and...


  • مصر, Egypt Procore Full time

    We’re looking for a **Senior Site Reliability Engineer** to join Procore’s Fintech cloud infrastructure team. In this role, you’ll work collaboratively with software engineers, software testing engineers, and product/project managers, to build, design, and shape the cloud infrastructure. Your role will also include improving and developing new platform...


  • مصر, Egypt Careem Full time

    Careem is building ‘the everything app’ for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Careem Full time

    Careem is building ‘the everything app’ for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Careem Full time

    Careem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Careem Full time

    Careem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Vodafone Full time

    **Role Purpose**: **Key Accountabilities and Decision Ownership**: - Use software as a tool to manage systems, solve problems, and automate resolution to achieve zero touch operations - Design and enhance software architecture to improve scalability, service reliability, capacity, and performance. - Defines, creates, promotes and monitors SLO’s/SLI’s...