Senior, Site Reliability Engineer at Mrsool

We are redirecting you to the source. If you are not redirected in 3 seconds, please click here.

Senior, Site Reliability Engineer at Mrsool. Who Are We❓. Welcome to the world of Mrsool! Where on-demand delivery meets unparalleled user needs to deliver anything you desire. As one of the largest delivery platforms in the Middle East and North Africa (MENA) region, Mrsool has captivated users with its unique and seamless experience, earning it the highest ratings among all major delivery platforms on both Apple's App Store and Google's Play Store. . What sets Mrsool apart is its commitment to providing an unmatched "order anything from anywhere" experience. This extraordinary feat is made possible by our extensive fleet of dedicated on-demand couriers. With their unwavering dedication, they ensure that your desired items reach your doorstep, no matter where you are. . Whether it's a late-night craving, a forgotten item, or a special gift for a loved one, Mrsool is here to deliver, quite literally. We take pride in the convenience we offer, empowering you to get what you need when you need it, all at the tap of a button. . The Job in a Nutshell💡. We are looking for a highly skilled . Senior Site Reliability Engineer. to ensure the reliability, scalability, and performance of our systems. The ideal candidate brings deep expertise in . AWS. , . Kubernetes. , and modern cloud infrastructure, along with strong problem-solving skills and a proactive approach to improving system resilience and automation.. If you're eager to take on this rewarding opportunity, we’d love to hear from you. Apply today!. What You Will Do💡. . Develop and maintain monitoring and alerting systems to proactively identify and address issues.. . Troubleshoot and escalate production incidents to minimize downtime and improve system reliability.. . Continuously improve our infrastructure and processes to optimize scalability and efficiency.. . Participate and take ownership for on-call rotations as needed to ensure 24/7 support for our application.. . Perform routine maintenance and upgrades as needed to keep our systems up to date.. . Contribute to ongoing efforts to improve our security posture and compliance with industry standards.. . Communicate complex technical concepts clearly and concisely to both technical and non-technical stakeholders in order to make the right decision.. . Mentor and coach junior engineers, fostering their professional growth and enabling them to deliver high-quality work.. . Stay up-to-date with the latest advancements and trends in site reliability engineering and share knowledge and insights with the team.. . Identify opportunities for organizational enhancements and propose alternatives to optimize team structures and execution.. . Collaborate with development teams to design and implement automated deployment and testing pipelines.. . Collaborate with development teams to design and implement scalable Infrastructure.. . What Are We Looking For❓. . Bachelor’s degree in Computer Engineering, Computer Science, or related field.. . 5+ years of experience in a similar role, preferably with experience in a high-traffic, high-availability environment.. . Proficiency in at least one programming language (Python, Ruby, Java, Go, etc.).. . Strong understanding of cloud infrastructure and related technologies (AWS, GCP, Azure, Kubernetes, Docker, etc.). . Excellent troubleshooting and problem-solving skills.. . Experience with one or more automation and configuration management tools (Chef, Ansible, Puppet, Terraform, etc.).. . Familiarity with monitoring and alerting tools (Prometheus, Grafana, Nagios, etc.). . Strong communication and interpersonal skills, enabling effective collaboration with cross-functional teams.. . Ability to navigate ambiguity, set clear expectations, and thrive in a fast-paced, dynamic environment.. . A strong grasp of computer science fundamentals when it comes to dealing with distributed systems and networks.. . Company Location: Egypt.