Site Reliability Engineer at Splice

We are redirecting you to the source. If you are not redirected in 3 seconds, please click here.

Site Reliability Engineer Splice. JOB TITLE: . Site Reliability Engineer. LOCATION: . Remote. THE ROLE:. As a Site Reliability Engineer joining our dedicated Site Reliability Engineering team, you would be joining a group of peers who are passionate about customer service, resilience, automation, scalability, capacity planning, and observability.. To help Splice to deliver on its mission we must better align our cloud costs to the business, scale efficiently to meet the demand of our customers, deliver a more evolvable architecture and provide clear levers to the business to make well-informed decisions. To succeed in this work we must carefully address foundational challenges around tight coupling and standardization across our systems. . If these are things you might be interested in helping us solve, please consider applying today!.  . WHAT YOU’LL DO:. Design, develop, and maintain self-service tools that streamline the development process, support continuous integration pipelines, improve observability and the provisioning of resilient and scalable cloud infrastructure.. Create and maintain clear and comprehensive documentation, tutorials, and guides, to help developers understand and use our tools effectively.. Seek and respond to feedback to gather insights, identify areas for improvement, and proactively address pain points.. Make pragmatic, informed trade-offs with a bias towards learning fast while maintaining long-term maintainability.. Write well-researched internal RFCs proposing solutions while considering the benefits and risks of alternative approaches.. Stay updated on the latest technologies, best practices, and industry trends in software development..  . JOB REQUIREMENTS:. Working experience with AWS and cloud platform fundamentals (ie VPC, going from zero to a best-practices nginx container service, etc).. Working experience with containers and container-related technologies (Docker, Kubernetes, AWS ECS, etc.). Demonstrated comprehension of configuration management and Infrastructure as Code tooling like Terraform, Ansible, etc.. Working experience with observability tooling such as Datadog, or similar.. Willingness to participate in a weekly 24/7 on-call rotation.. Working experience building reliable CI/CD workflows, tools and processes using platforms such as Github Actions, Azure DevOps, Jenkins, GitLab CI, CircleCI, Bitrise or similar frameworks.. Experience programming in languages such as Python, Bash, or Go. . Proficiency utilizing Unix/Linux systems and tools to manage and troubleshoot server environments.. Experience collaborating with engineers as internal customers, proactively addressing friction points to enhance the engineering experience, and learning from others' experiences along the way.. Proven ability to produce clear and concise technical documents.. Outstanding communication, organizational skills, and a customer-centric mindset..  . NICE TO HAVES:. Knowledge of networking best practices and concepts (OSI model, routing, peering, hub-and-spoke topologies, analytics, etc.). Knowledge of security best practices and procedures (Secrets Management, Threat Modeling, etc.). Working experience with GCP and/or Azure.  . The national pay range for this role is $129,500- $142,000. Individual compensation will be commensurate with the candidate's experience.