
Sr DevOps Engineer at Curology. This position will be responsible for designing, implementing, and maintaining scalable cloud infrastructure to support Curology's mission of making effective skincare accessible to everyone. In this role, you will collaborate with engineering teams across the organization to build robust, secure, and highly available systems on AWS. This position will require a strong understanding of cloud architecture, containerization, and infrastructure automation.. To be successful in this position, candidates should have hands-on experience with AWS, EKS, Terraform, Datadog, Grafana, and AuroraDB (MySQL), with the ability to be flexible and adopt new tools and technologies as business needs evolve. In this role, you will work closely with development teams to implement DevOps best practices, drive automation initiatives, and ensure optimal system performance and reliability.. Essential Duties and Job Functions. . Design and implement scalable AWS cloud infrastructure using Infrastructure as Code principles. . Manage and optimize Kubernetes clusters on Amazon EKS for containerized applications. . Develop, maintain, and enhance Terraform modules and configurations for consistent infrastructure provisioning. . Implement comprehensive monitoring, alerting, and observability solutions using Datadog and Grafana. . Optimize database performance, reliability, and scaling for AuroraDB (MySQL) clusters. . Build and improve CI/CD pipelines to enable rapid, safe deployments across multiple environments. . Execute infrastructure projects from concept to completion with minimal guidance in a fast-paced startup environment. . Collaborate with development teams to implement DevOps best practices and improve system observability. . Maintain and support critical infrastructure components and integrations. . Participate in on-call rotation and incident response procedures. . Perform routine system maintenance, security updates, and capacity planning. . Drive automation initiatives to reduce manual operational overhead. . Execute ad-hoc infrastructure tasks as needed. . . 5+ years of hands-on experience in DevOps, Site Reliability Engineering, or cloud infrastructure. . Extensive understanding of AWS services and cloud architecture patterns. . Experience with Amazon EKS and Kubernetes cluster management in production environments. . Proficiency with Terraform for infrastructure automation and state management. . Experience with Datadog for application and infrastructure monitoring. . Hands-on experience with Grafana for data visualization and dashboard creation. . Strong background with MySQL databases, preferably Amazon Aurora. . Understanding of containerization technologies (Docker, Kubernetes). . Integration experience with CI/CD tools and deployment automation. . General understanding of best practices in security, networking, and compliance. . Knowledge of scripting languages (Python, Bash, etc.). . Proficiency with version control systems (Git) is a MUST. . Bachelor's degree in Computer Science, Engineering, or related field is preferred. . Ability to balance multiple concurrent projects and competing priorities, solve problems quickly, take initiative and work independently. . Attention to detail, strong analytical mindset and excellent communication and collaboration skills. . Ability to participate in on-call rotation as needed. . Company Location: United States.