Operations Lead Engineer (Remote - US) at Jobgether

We are redirecting you to the source. If you are not redirected in 3 seconds, please click here.

Operations Lead Engineer (Remote - US) at Jobgether. This position is posted by Jobgether on behalf of Paytronix. We are currently looking for an Operations Lead Engineer in United States.. This role leads the Network Operations Center (NOC) functions, ensuring the stability, performance, and resilience of critical infrastructure. Acting as a central point during incidents, outages, and escalations, you will guide engineers and analysts toward operational excellence. The position offers the opportunity to shape monitoring strategies, implement automation, and optimize incident management processes to reduce manual intervention. Working closely with IT operations, cloud engineers, software developers, and customer support teams, you will influence both the day-to-day operations and long-term technology roadmaps. The environment is collaborative, fast-paced, and impact-driven, with room for innovation and leadership.. . Accountabilities. . Lead the transformation of NOC functions, establishing clear ownership areas and automated standard operating procedures.. . Develop and implement tools, dashboards, and predictive monitoring systems to detect and address issues proactively.. . Create automated alerting and remediation processes to minimize human intervention in incident resolution.. . Partner with IT Compliance to ensure alignment with PCI, SOC, and other industry frameworks.. . Oversee post-incident reviews, root cause analyses, and structured problem management for recurring high-severity issues.. . Monitor NOC KPIs, produce monthly operational reports, and drive continuous improvement initiatives.. . Collaborate with cross-functional teams, including development, cloud operations, and customer support, to maintain high service availability.. . Participate in a 24/7 on-call rotation and maintain readiness for critical incident response.. . . 5+ years in a corporate IT environment, with at least 3 years in incident or problem management.. . Proven experience in ITIL disciplines, especially Incident, Problem, and Change management.. . Demonstrated ability to manage high-impact IT incidents in pressured environments.. . Experience supporting 24/7 SaaS platforms and collaborating across technical and business teams.. . Strong communication and presentation skills.. . Familiarity with automation, monitoring, and infrastructure-as-code tools (e.g., Terraform, Ansible, Jenkins, Kubernetes).. . Knowledge of cloud platforms such as AWS and Azure, including scalability, security, and performance optimization.. . Bonus: ITIL certification, project management skills, Agile/DevOps practices, and scripting experience (Bash, PowerShell, Python).. . Company Location: United States.