Site Reliability Engineer (Remote - India or Pakistan) at Jobgether

Source: https://jobs.workable.com/view/3KjXfhua4J5quCej2UdkHZ/site-reliability-engineer-(remote---india-or-pakistan)-in-india-at-jobgether

We are redirecting you to the source. If you are not redirected in 3 seconds, please click here.

Site Reliability Engineer (Remote - India or Pakistan) at Jobgether. This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Site Reliability Engineer in India and Pakistan.. As a Site Reliability Engineer, you will ensure the stability, performance, and scalability of mission-critical systems in a fast-growing, remote-first environment. You will work closely with cross-functional teams, including platform, security, and product engineers, to design, automate, and optimize cloud infrastructure and operational processes. Your contributions will directly impact system uptime, developer velocity, and the seamless delivery of high-quality software to users. This role offers the opportunity to work with cutting-edge technologies, improve observability and reliability practices, and grow as a technical leader while solving complex infrastructure challenges.. Accountabilities. In this role, you will be responsible for:. . Maintaining uptime and reliability across critical systems, focusing on scalability, observability, and incident prevention.. . Designing and managing cloud infrastructure using Terraform, Kubernetes, and CI/CD pipelines.. . Automating operational tasks, monitoring, deployment, and disaster recovery processes.. . Supporting and improving on-call processes, including incident response, retrospectives, and tooling.. . Collaborating cross-functionally to implement best practices and deliver reliable software.. . Building monitoring dashboards, alerts, and documentation to provide visibility into system health.. . Contributing to infrastructure projects that enhance security, performance, and developer efficiency.. . To excel in this role, you should bring:. . Hands-on experience operating cloud-based systems (AWS preferred).. . Proficiency with Kubernetes, Helm, and Docker.. . Familiarity with CI/CD tools and deployment pipelines.. . Strong understanding of observability tools such as Datadog, Grafana, or Prometheus.. . Solid scripting or programming skills (Node.js experience is a plus).. . Ability to troubleshoot complex issues quickly and communicate effectively.. . Knowledge of systems design, incident management, and reliability best practices.. . Comfort working in high-speed, high-scale environments.. . Nice-to-Have:. . Experience with messaging systems such as RabbitMQ, Kafka, or NATS.. . Exposure to internal developer platforms or tooling.. . Prior experience in DevOps, platform, or infrastructure teams.. . Experience supporting sandbox, staging, or demo environments.. . Company Location: India.