Site Reliability Engineer (Remote - Texas) at Jobgether

We are redirecting you to the source. If you are not redirected in 3 seconds, please click here.

Site Reliability Engineer (Remote - Texas) at Jobgether. This position is posted by Jobgether on behalf of Imubit. We are currently looking for a Site Reliability Engineer in Texas (USA).. This role offers an exciting opportunity to design, maintain, and optimize cloud infrastructure that supports high-stakes industrial AI applications. You will work closely with software developers, DevOps engineers, and other stakeholders to ensure systems remain reliable, scalable, and secure. The position combines hands-on infrastructure work with automation, monitoring, and incident response, providing a chance to solve complex problems in distributed systems. You will directly influence the performance and efficiency of cloud services, contribute to continuous improvement initiatives, and implement best practices for production-grade systems. This is an ideal role for someone passionate about cloud technologies, automation, and proactive system management.. Accountabilities. . Design, deploy, and maintain cloud infrastructure ensuring high uptime, scalability, and security.. . Optimize deployment processes and manage cross-cloud network infrastructure, including subnets, routing tables, VPNs, transit gateways, and firewall rules.. . Participate in incident management and on-call rotation, quickly identifying and resolving issues to minimize downtime.. . Automate repetitive tasks and improve system efficiency through Infrastructure-as-Code (IaC) and other automation tools.. . Monitor and analyze system performance, applying insights to enhance reliability and scalability.. . Collaborate with software developers and DevOps teams to implement robust solutions and continuous improvements.. . Stay current with industry trends, best practices, and emerging technologies to evolve system infrastructure.. . . 4+ years maintaining production-level cloud infrastructure, including public cloud platforms such as AWS or GCP.. . Bachelor’s degree in Computer Science or equivalent experience preferred.. . Proficiency in a programming language such as Python or Go.. . Experience deploying and supporting Kubernetes services and using GitOps tools like ArgoCD.. . Familiarity with software development principles and version control systems (e.g., Git).. . Experience with monitoring tools such as New Relic, Splunk, Grafana, or Prometheus.. . Experience managing production databases, including managed services like PostgreSQL/AWS RDS.. . Knowledge of Infrastructure-as-Code tools such as Terraform or Ansible.. . Experience with secrets management tools such as HashiCorp Vault or AWS Secrets Manager.. . Strong analytical, debugging, and problem-solving skills, with the ability to automate routine tasks and optimize systems.. . Excellent communication skills, ownership mindset, and proactive approach to system reliability.. . Company Location: United States.