
LB - Cloud Infrastructure Engineer - 0040 at Thaloz. Job Summary. We are seeking a highly skilled and experienced Senior Cloud Infrastructure Engineer to join our dynamic team. This role is critical to designing, implementing, and managing scalable, reliable, and secure cloud infrastructure on Amazon Web Services (AWS). The ideal candidate will play a pivotal role in enabling our organization to leverage cloud technologies effectively, ensuring robust infrastructure that supports our business goals. This position requires a deep understanding of cloud architecture, infrastructure as code, monitoring, security, and continuous integration/continuous deployment (CI/CD) pipelines. The successful candidate will collaborate closely with cross-functional teams to deliver innovative solutions that drive operational excellence and business growth.. Job Responsibilities. . Designing, deploying, and maintaining scalable and secure cloud infrastructure on AWS that meets the needs of our business and technical teams.. . Developing and managing infrastructure as code (IaC) using Terraform and AWS CloudFormation to automate provisioning and configuration of cloud resources.. . Implementing comprehensive monitoring and alerting solutions using AWS CloudWatch, Prometheus, and Grafana to ensure system health, performance, and availability.. . Managing logging and auditing systems with AWS CloudTrail and the ELK (Elasticsearch, Logstash, Kibana) stack to provide visibility and traceability of infrastructure events.. . Collaborating with software engineering, security, and operations teams to define infrastructure requirements, architect solutions, and implement best practices.. . Building and maintaining CI/CD pipelines using GitHub Actions, Jenkins, or GitLab CI/CD to automate application deployment and infrastructure changes.. . Ensuring adherence to security best practices and compliance standards, including identity and access management, encryption, network security, and regulatory requirements.. . Providing technical leadership and mentorship to junior engineers and other team members.. . Participating in on-call rotations to promptly respond to and resolve infrastructure incidents, minimizing downtime and impact on business operations.. . Continuously researching and adopting new cloud technologies and methodologies to improve infrastructure efficiency, reliability, and security.. . . Extensive experience designing, deploying, and managing cloud infrastructure on AWS, including core services such as EC2, S3, VPC, IAM, RDS, and Lambda.. . Proficient in writing, testing, and maintaining infrastructure as code using Terraform to automate cloud resource provisioning and management.. . Skilled in using AWS CloudFormation templates for infrastructure automation and orchestration.. . Expertise in setting up monitoring, logging, and alerting using AWS CloudWatch to track system metrics and respond to operational issues.. . Experience implementing Prometheus for monitoring containerized and microservices environments, including custom metrics collection.. . Ability to create and maintain Grafana dashboards for visualizing metrics and logs to support operational decision-making.. . Familiarity with Datadog for cloud infrastructure monitoring, log management, and alerting.. . . AWS CloudTrail:. Knowledge of AWS CloudTrail for auditing and tracking API activity to ensure security and compliance.. . . ELK Stack (Elasticsearch, Logstash, Kibana):. Experience managing centralized logging solutions using the ELK stack to aggregate, analyze, and visualize logs.. . . GitHub Actions, Jenkins, GitLab CI/CD:. Hands-on experience designing and maintaining CI/CD pipelines using one or more of these tools to automate build, test, and deployment workflows.. . . Networking:. Strong understanding of cloud networking concepts including VPC design, subnets, routing, VPN, security groups, and load balancing.. . . Security Best Practices:. Deep knowledge of cloud security principles such as least privilege access, encryption, key management, vulnerability management, and incident response.. . . Compliance:. Familiarity with compliance frameworks and standards relevant to cloud infrastructure, ensuring infrastructure meets regulatory and organizational policies.. . . Problem-Solving:. Excellent analytical and troubleshooting skills to diagnose and resolve complex infrastructure issues.. . . Communication:. Strong verbal and written communication skills to effectively collaborate with technical and non-technical stakeholders.. . . Collaboration:. Proven ability to work cross-functionally with engineering, security, and operations teams to deliver integrated solutions.. . . On-Call Participation:. Willingness to participate in on-call rotations to provide timely response to infrastructure incidents.. . . Nice to have. . . Startup Experience:. Experience working in fast-paced startup environments, demonstrating agility and adaptability.. . . Insurance Domain Knowledge:. Understanding of the insurance industry and its specific infrastructure and compliance requirements.. . . Bachelor's or Master's Degree in Computer Science or Engineering:. Formal education background in relevant technical fields.. . . AWS Certified Solutions Architect:. Professional certification validating expertise in designing AWS solutions.. . . AWS Certified DevOps Engineer:. Certification demonstrating skills in AWS DevOps practices and automation.. . . Docker:. Experience containerizing applications using Docker to improve deployment consistency and scalability.. . . Kubernetes:. Knowledge of Kubernetes for container orchestration and management in cloud environments.. . . Serverless Computing:. Familiarity with serverless architectures and services such as AWS Lambda and API Gateway.. . . AWS Lambda:. Hands-on experience developing and deploying serverless functions using AWS Lambda.. . . API Gateway:. Understanding of API Gateway for creating, publishing, and securing APIs in AWS.. . Company Location: Colombia.