Senior Site Reliability Engineer
1 week ago
**Job Description**:
We’re looking for a **Senior SRE Engineer** to join Procore’s Fintech cloud infrastructure team. In this role, you’ll work collaboratively with software engineers, software testing engineers, and product/project managers, to build, design, and shape the cloud infrastructure. Your role will also include improving and developing new platform features that streamline our operations and development processes. You will build and maintain tools for self-service deployment, monitoring, and operations. You will also be responsible for troubleshooting and resolving issues in our Dev, test, and production environments across the full stack of hardware and software.
This position reports to the head of the cloud infrastructure and will be based in our Cairo Office**.** We’re looking for someone to join us immediately.
**What you’ll do**:
- Design, implement, and maintain modern microservices infrastructure, with automated continuous delivery (CI/CD) pipelines for various customer-facing and internal systems.
- Improve and add features to the cloud infrastructure platform through self-service automated tools that are used by engineering teams.
- Understand how systems fail and work with teams to reduce the risks.
- Automate provisioning of production, testing, and staging environments.
- Maintain the uptime and availability of production systems.
- Design, implement, and maintain monitoring strategies and tools for our important metrics.
- Design, implement, and maintain backup/restore strategies and tools.
**What we’re looking for**:
- Strong computer science/engineering background.
- Experience with automation/configuration management IaC tooling, especially Terraform and Helm Charts.
- Ability to use a wide variety of open-source technologies and cloud services, specifically AWS.
- Strong experience with RDBMS especially MySQL.
- NoSQL, Graph, and Key value data storage knowledge is a plus.
- Advanced level of coding and scripting in any of the following scripting languages: Python, Node.js, Go.
- Experience with CI/CD tools (Jenkins, Argo CD, etc.).
- Mastery of containerization technologies and orchestration runtimes, especially Kubernetes (AWS EKS)
- Experience with message brokers/queues is a huge plus (RMQ, Kafka, AWS Kinesis)
- Understand OS, networks, or hardware and can debug system issues and identify system bottlenecks.
- Advanced Linux/Bash/shell scripting knowledge.
- Good cloud-native security knowledge. Experience in AWS security is a huge plus.
Additional Information
If you'd like to stay in touch and be the first to hear about new roles at Procore, join our Talent Community.
**About Us**
Procore Technologies is building the software that builds the world. We provide cloud-based construction management software that helps clients more efficiently build skyscrapers, hospitals, retail centers, airports, housing complexes, and more. At Procore, we have worked hard to create and maintain a culture where you can own your work and are encouraged and given resources to try new ideas. Check us out on Glassdoor to see what others are saying about working at Procore.
We are an equal opportunity employer and welcome builders of all backgrounds. We thrive in a diverse, dynamic, and inclusive environment. We do not tolerate discrimination against employees on the basis of age, color, disability, gender, gender identity or expression, marital status, national origin, political affiliation, race, religion, sexual orientation, veteran status, or any other classification protected by law.
-
Site Reliability Engineer
2 weeks ago
مصر, Egypt Envision Employment Solutions Full time**Ready and hungry for a new adventure? You are definitely in the right place! We at **Envision Employment Solutions** are always on the look for top talents around the globe and matching them with our partners' hiring needs, to help them build and scale! - Our partners offer awesome work environment, competitive salaries, full benefits, and many others...
-
Senior Site Reliability Engineer
2 days ago
مصر, Egypt Evolvice Full timeEvolvice is a German nearshore service provider with branches in Egypt and Ukraine. Founded in 2012, Evolvice has a strong technical background and business domain knowledge, combining software engineering and Agile methodology, leading its’ clients’ path to digital transformation. Headquartered in the heart of the automobile industry, Stuttgart...
-
Senior Site Reliability Engineer
1 week ago
مصر, Egypt Procore Technologies Full time**Job Description**: We’re looking for a **Senior Site Reliability Engineer** to join Procore’s Fintech cloud infrastructure team. In this role, you’ll work collaboratively with software engineers, software testing engineers, and product/project managers, to build, design, and shape the cloud infrastructure. Your role will also include improving and...
-
Reliability Engineer
1 week ago
مصر, Egypt PepsiCo Full time**Responsibilities**: - Key stakeholder in delivering PEMM results for the Maintenance Support Department. - Will have ownership of the Reliability section of the site maintenance improvement plan (MIAP) coming from every PeMM assessment. - Lead the site Asset Reliability program. Own and develop the site Major Incident Report (MIR), Analytical Problem...
-
Senior Site Reliability Engineer I
6 days ago
مصر, Egypt Careem Full timeCareem is building ‘the everything app’ for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...
-
Site Reliability/devops Engineer
1 week ago
مصر, Egypt Qoyod Full timeJob Summary As Site Reliability/DevOps Engineer, you will introduce processes, tools, and methodologies to balance needs throughout the software development life cycle, from coding and deployment to maintenance and updates. **Responsibilities**: - Focus on improving the scalability, robustness, and automation of our tools and processes, as well as...
-
Senior Site Reliability Engineer I
2 days ago
مصر, Egypt Careem Full timeCareem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...
-
Site Reliability Engineer Ii
2 weeks ago
مصر, Egypt Careem Full timeAt Careem we are led by a powerful purpose to simplify and improve lives in the Middle East, North Africa and Pakistan. We're pioneering the development of innovative services to aid the mobility of people, the mobility of things and the mobility of money. We're in the driving seat as we help to define how technology will shape progress in some of the...
-
Senior Site Reliability Engineer Ii
6 days ago
مصر, Egypt Careem Full timeEgypt Careem is building the Everything App for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million...
-
مصر, Egypt Souq.com for E-Commerce LLC Full time3+ years experience planning, scheduling and auditing maintenance activities either as a hands on engineer or as a maintenance planner. - Experience with CMMS software. - Experience in using the core functions of MS Excel. - Experience managing stores or spare parts inventories. - Ability to communicate (written & verbal) in English and the local language at...