Site Reliability Engineer
Toters
The Company
Toters is an on-demand e-commerce and delivery platform and operates a service that enables customers to get anything in their city at the highest level of convenience.
At Toters, technology is at the heart of everything we do. We have product teams that are working hard every day to create products that make our customers' lives easier. Our engineers are also continuously creating solutions to make our processes more efficient, all in an effort to get to our customers fast and at the best cost. If you are interested in working in a high growth startup environment, and look to be part of a team that will potentially change the way customers shop in the Middle East, apply now.
Role
The ideal candidate should thrive in fast-paced environments that require multi-functional thinking, collaboration, and innovation.
As a Site Reliability Engineer, you will be responsible for ensuring the reliability and availability of our software platforms. You will work closely with the development and infrastructure teams to ensure the deployment and maintenance of our software is smooth and efficient. Additionally, the Site Reliability Engineer will be responsible for implementing best practices and processes to ensure maximum uptime and performance of our platforms.
Description
A Site Reliability Engineer is a key role in toter’s infrastructure team as it ensures that our app is highly available and performant for our end-users. In this role you will:
- Collaborate with development and infrastructure team to ensure smooth deployment and maintenance of our software platforms
- Design, implement and maintain systems for monitoring, alerting, and logging
- Ensure that our platforms are scalable and reliable
- Implement best practices for disaster recovery and business continuity planning
- Perform root cause analysis on production issues and implement corrective actions
- Participate in on-call rotation to ensure rapid response to system outages and incidents
Key Qualifications
- Bachelor's degree in Computer Science or a related field
- 4+ years of experience in site reliability engineering, infrastructure engineering, or a related field
- Strong experience with cloud-based infrastructure AWS
- Strong experience with containerization and orchestration (Docker, Kubernetes)
- Solid understanding of networking, storage, and database technologies
- Experience with scripting languages (Bash or Ruby)
- Excellent problem-solving and analytical skills
- Strong communication and collaboration skills
- Ability to work in a fast-paced, dynamic environment
Nice to have
- Experience collaborating with product, engineering, marketing and operations teams
- Previous experience with food delivery applications
- Fluency in English and Arabic. French is a plus
- PMP certification would be beneficial, but not required
- Exceptional communication, leadership, and influence skills
- Strong partnership and cross-functional collaboration skills.