Senior Site Reliability Engineer at MLabs. Senior Site Reliability Engineer (Enterprise Platform). Location: . Remote - US - Open to Europe if happy to overlap with EST. Compensation: . Competitive. We are a high-growth software company supporting the development of a premier open-source, EVM-compatible public ledger built for global enterprise and Web3 use cases. We are currently hiring a . Senior Site Reliability Engineer. for our "greenfield" enterprise-focused team. This team is building a private and consortium distributed ledger platform designed specifically for sectors with high security and privacy requirements, such as financial services, healthcare, and supply chain.. This is a hands-on, high-impact role where you will own the design, deployment, and reliability of mission-critical, multi-region infrastructure. This is not a traditional support role; we are looking for an engineer who has operated real systems at scale and is eager to take end-to-end ownership of architecture and operational standards from the ground up.. Key Responsibilities:. Systems Architecture:. Design and operate highly available, multi-region distributed systems with rigorous recovery strategies (RTO/RPO).. Infrastructure as Code:. Own large-scale IaC using Terraform, developing reusable modules and multi-account patterns with policy guardrails.. Kubernetes Orchestration:. Scale production environments (EKS, GKE, or AKS) utilizing GitOps (ArgoCD), Helm, and strict network policies.. CI/CD Leadership:. Build secure pipelines supporting blue/green and canary deployments, artifact signing (SBOM), and automated rollback strategies.. SRE Advocacy:. Define and improve SLOs, error budgets, and observability metrics to drive measurable reductions in MTTR.. Collaboration:. Partner with the Head of SRE and VP of Engineering to translate complex business requirements into reliable, secure platform services.. 7+ years of experience. in SRE, Platform Engineering, or Infrastructure Engineering operating production distributed systems.. Multi-Cloud Mastery:. Deep expertise in AWS or GCP, with experience running multi-region production environments and disaster recovery testing.. Containerization:. Hands-on experience with Kubernetes at scale, including GitOps workflows and production-grade security controls.. Security Mindset:. Strong background in Zero Trust principles, secrets management (Vault), and compliance frameworks (SOC 2, HIPAA, or NIST).. Tooling:. Extensive experience with Terraform-first infrastructure in large-scale, real-world environments.. Nice to Have:. Experience with distributed ledger technology (DLT) or blockchain systems, particularly private/consortium deployments.. Familiarity with EVM-based systems and smart contract tooling (Solidity, Hardhat).. Experience operating active-active, globally distributed architectures.. Background in supporting financial services or other highly regulated industries.. Company Location: Netherlands.
Senior Site Reliability Engineer at MLabs