Senior Software Engineer - Site Reliability (Federal Operations) at Jobgether

We are redirecting you to the source. If you are not redirected in 3 seconds, please click here.

Senior Software Engineer - Site Reliability (Federal Operations) at Jobgether. This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Software Engineer - Site Reliability (Federal Operations) in the United States.. We are seeking a highly skilled Senior Software Engineer with strong DevOps expertise to support and optimize restricted government cloud environments. This role combines software development, infrastructure automation, and site reliability engineering to ensure high performance, reliability, and compliance for federal users. The ideal candidate will lead improvements in CI/CD pipelines, container orchestration, and distributed systems while mentoring junior engineers. This position offers the opportunity to work on complex, large-scale systems, collaborating across teams to implement secure, efficient, and scalable solutions. Strong problem-solving abilities, leadership experience, and the ability to navigate ambiguous and regulated environments are essential. The role emphasizes operational excellence, continuous improvement, and technical innovation in a mission-critical federal context.. . Accountabilities. Automate and develop tools to reduce repetitive operational tasks and minimize toil.. Maintain and scale highly reliable software applications using DevOps best practices.. Build and enhance CI/CD pipelines for automated testing, builds, and deployments.. Optimize and maintain Kubernetes-based orchestration systems for performance and reliability.. Troubleshoot complex production issues across application, infrastructure, and distributed system layers.. Participate in on-call rotations and support incident response.. Mentor junior engineers in software development and operational best practices.. Collaborate with stakeholders and product teams to meet infrastructure and deployment requirements.. Ensure compliance with government cloud standards across applications and infrastructure.. . 10+ years of overall experience, including 6+ years in software development and 3+ years in DevOps practices.. 3+ years of experience with Kubernetes, Terraform, Python or Go, and AWS.. 4+ years of experience with distributed systems.. Proven ability to maintain 99.99% uptime in production environments.. Experience with Redis, Kafka/PubSub, and relational databases.. Strong collaboration and communication skills across cross-functional teams.. Ability to ramp up quickly and contribute in complex, large-scale environments.. Demonstrated leadership in incident management and operational reliability.. Experience in fast-paced or startup-like environments.. Nice-to-have: familiarity with FedRAMP compliance and government security requirements.. Nice-to-have: experience implementing secure CI/CD pipelines in restricted or regulated environments.. . Company Location: United States.