Mid / Senior Site Reliability Engineer

4 weeks ago


مصر, Egypt Convertedin Full time

**About us**:
Convertedin is a marketing operating system for e-Commerce. It utilizes data and shoppers' insights to create personalized multi-channel marketing that boosts customer engagement and maximizes their return on their marketing budget by leveraging artificial intelligence capabilities. Convertedin has helped more than 800 e-Commerce worldwide.

**About the job**

Your role will involve the creation of software systems and automated solutions to address operational needs within Convertedin.

Your duties will encompass overseeing computer systems and establishing alert mechanisms to address a range of operational challenges that our computer systems may encounter.

Ultimately, you will collaborate with our Infrastructure and Security team to guarantee the seamless delivery of products and services within Convertedin's computer system environment.

**Responsibilities**:

- Participate in system design consulting, platform management, and capacity planning
- Partner with engineering teams to improve services through rigorous testing and release procedures.
- Run the production environment by monitoring availability and taking an in-depth view of system health.
- Build monitoring that alerts on symptoms/issues rather than on outages.
- Debug production issues across services and levels of the hosting stack.
- Improve reliability, quality, and time-to-market of our suite of software solutions.
- Design, build, plan, and maintain core infrastructure that enables ConvertedIn scaling to support thousands of concurrent users.
- Create sustainable systems and services through adaptation of automation.
- Balance feature development speed and reliability with well-defined service-level objectives.
- Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual enhancement.
- Be on an on-call rotation to respond to incidents that impact ConvertedIn services availability.
- Document all the things so you don’t need to learn the same thing twice.
- Deliver root cause analysis and corrective actions for reported incidents

**Requirements**:

- 2-5 years of experience as any of the following, Site Reliability Engineer, DevOps Engineer, System Engineer, Infrastructure Engineer, Cloud Support Engineer.
- Good command of English language.
- Configuration management: Ansible to manage our infrastructure hosting.
- Infrastructure as code: Use Terraform and GitHub actions for automation, and leverage cloud technologies to meet our goals.
- Systems: manage, configure and troubleshoot operating system issues, storage (block and object), networking (VPCs, proxies and CDNs), and operating system (Linux) configuration, package management, startup and troubleshooting.
- Cloud services: Cloud resources provisioning and configuration through CLI/API
- Solid understanding of the software development lifecycle.
- Monitoring and instrumentation: implement metrics in Prometheus, Grafana, log management and related system, and third-parties integrations like (slack, ,datadog, sentry, pagerduty)
- Strong Knowledge of managing CI/CD tools
- Engineering practices: availability, reliability and scalability, as well as disaster recovery.
- Work in a variety of languages: Shell, Python

**Benefits**
- Work from Anywhere, yes this remote base opportunity with the availability of hybrid options.
- We love flexibility Manage your own work schedule. We trust the people we hire.
- Competitive package.
- Paid time off.
- Fun benefits & perks that you will enjoy
- Trusting, ego-free and truth-seeking team members
- See something you want to improve? Awesome. We’re a flexible and collaborative team that is always learning and growing.
- Need more convincing? Check this out
- Convertedin is an equal employment opportunity employer such that all qualified applicants will receive consideration for employment without regard to race, color, age, religion, sex, sexual orientation, gender identity/expression, national origin or disability.



  • مصر, Egypt Convertedin Full time

    **About us**: Convertedin is a marketing operating system for e-Commerce. It utilizes data and shoppers' insights to create personalized multi-channel marketing that boosts customer engagement and maximizes their return on their marketing budget by leveraging artificial intelligence capabilities. Convertedin has helped more than 800 e-Commerce...


  • مصر, Egypt Convertedin Full time

    **About us**: Convertedin is a marketing operating system for e-Commerce. It utilizes data and shoppers' insights to create personalized multi-channel marketing that boosts customer engagement and maximizes their return on their marketing budget by leveraging artificial intelligence capabilities. Convertedin has helped more than 800 e-Commerce...


  • مصر, Egypt Evolvice Full time

    Evolvice is a German nearshore service provider with branches in Egypt and Ukraine. Founded in 2012, Evolvice has a strong technical background and business domain knowledge, combining software engineering and Agile methodology, leading its’ clients’ path to digital transformation. Headquartered in the heart of the automobile industry, Stuttgart...

  • Site Reliability

    1 week ago


    مصر, Egypt ASWAT Full time

    **We Are Hiring: Site Reliability and DevOps Engineer** **About Us**: **ZIWO** is an Omni-channel Cloud Contact Center Software (CCAAS) providing straightforward solutions for companies to communicate with their clients via Phone, WhatsApp, SMS, and more. We connect 145 countries globally, including the GCC, enabling users to instantly expand their reach...


  • مصر, Egypt Procore Technologies Full time

    **Job Description**: We’re looking for a **Senior Site Reliability Engineer** to join Procore’s Fintech cloud infrastructure team. In this role, you’ll work collaboratively with software engineers, software testing engineers, and product/project managers, to build, design, and shape the cloud infrastructure. Your role will also include improving and...


  • مصر, Egypt Procore Full time

    We’re looking for a **Senior Site Reliability Engineer** to join Procore’s Fintech cloud infrastructure team. In this role, you’ll work collaboratively with software engineers, software testing engineers, and product/project managers, to build, design, and shape the cloud infrastructure. Your role will also include improving and developing new platform...


  • مصر, Egypt Careem Full time

    Careem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt DXC Technology Full time

    **Job brief** We are looking for a Juinor Site Reliability Engineer for our Early Grads Program. **Responsibilities**: - Set up, operate and manage environments. - Handle code deployments in all environments. - Operate over all types of infrastructures like on-prem cloud and/or container-based platforms - Develop automation with a focus on scalability,...


  • مصر, Egypt IBM Full time

    Introduction management tasks more efficiently. We’re seeking skilled, automation-focused Network Engineers to maintain and administer the Power Virtual Server Cloud Infrastructure-as-a-Service environment and provide reliable and secure network operations. The Network Infrastructure Operations Site Reliability Engineer works with clients to...


  • مصر, Egypt IBM Full time

    Introduction management tasks more efficiently. We’re seeking skilled, automation-focused Network Engineers to maintain and administer the Power Virtual Server Cloud Infrastructure-as-a-Service environment and provide reliable and secure network operations. The Network Infrastructure Operations Site Reliability Engineer works with clients to...

  • Reliability Engineer

    2 weeks ago


    مصر, Egypt PepsiCo Full time

    **Responsibilities**: - Key stakeholder in delivering PEMM results for the Maintenance Support Department. - Will have ownership of the Reliability section of the site maintenance improvement plan (MIAP) coming from every PeMM assessment. - Lead the site Asset Reliability program. Own and develop the site Major Incident Report (MIR), Analytical Problem...


  • مصر, Egypt IBM Full time

    Introduction management tasks more efficiently. We’re seeking skilled, automation-focused Network Engineers to maintain and administer the Power Virtual Server Cloud Infrastructure-as-a-Service environment and provide reliable and secure network operations. The Network Infrastructure Operations Site Reliability Engineer works with clients to...


  • مصر, Egypt TrianglZ Full time

    TrianglZ is hiring Mid-Senior iOS Engineer! Tasks **Requirements**: - 2+ years of iOS development experience in Swift - Solid understanding of the full mobile development - Strong experience in iOS mobile app development (Swift, Cocoa Touch) - Experience with third-party libraries and APIs (UIKit with Autolayout, Alamofire, Firebase, RestAPI, Websockets,...


  • مصر, Egypt Careem Full time

    Careem is building ‘the everything app’ for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Qoyod Full time

    Job Summary As Site Reliability/DevOps Engineer, you will introduce processes, tools, and methodologies to balance needs throughout the software development life cycle, from coding and deployment to maintenance and updates. **Responsibilities**: - Focus on improving the scalability, robustness, and automation of our tools and processes, as well as...


  • مصر, Egypt Careem Full time

    Careem is building ‘the everything app’ for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Careem Full time

    Careem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Careem Full time

    Careem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...


  • مصر, Egypt Procore Technologies Full time

    **Job Description**: We’re looking for a **Senior SRE Engineer** to join Procore’s Fintech cloud infrastructure team. In this role, you’ll work collaboratively with software engineers, software testing engineers, and product/project managers, to build, design, and shape the cloud infrastructure. Your role will also include improving and developing new...


  • مصر, Egypt Givaudan Full time

    Join us and celebrate the beauty of human experience. Create for happier, healthier lives, with love for nature. Together, with kindness and humility, we deliver food innovations, craft inspired fragrances and develop beauty and wellbeing solutions that make people look and feel good. There’s much to learn and many to learn from, with more than 16,000...