Staff Cloud DevOps/Site Reliability Engineer (SRE) Inworld AI. Our Technical Operations team manages the infrastructure, DevOps, and Site Reliability of our platform. We are looking for a Staff Cloud DevOps/Site Reliability Engineer to join our team. . Qualifications. Bachelor's degree in Computer Science, Engineering, or a related field. 7+ years of experience as a DevOps, Infrastructure, Operations, or Site Reliability Engineer (or as a software engineer with relevant experience).. At least 2 years experience each with:. Terraform. Helm. Kubernetes. AWS, Azure, or GCP. CI/CD using modern tools (GitOps). Optional (not required but considered a plus):. MLOps (building, orchestrating, and maintaining Machine Learning Pipelines). Prometheus / Grafana. Multi-cloud deployments (2 or more). ArgoCD. Network management and VPNs. Responsibilities. Infrastructure: Maintain and contribute to Infrastructure-as-Code (Terraform). DevOps and CI/CD Pipelines: Orchestrate pipelines using Github Actions, Helm, ArgoCD. Microservices scalability: Kubernetes Administration. Cloud Administration. Site Reliability: Measure and monitor availability, latency, and overall service health, drive incident management and post-mortem analysis. In-office location: Vancouver, Canada.. Remote location: Canada.. The Canada base salary range for this full-time position is CAD $170,000 - $220,000. In addition to base pay, total compensation includes bonus, equity and benefits. Within the range, individual pay is determined by work location and additional factors, including competencies and experience.. . Inworld Jobs Privacy
Staff Cloud DevOps/Site Reliability Engineer (SRE) at Inworld AI