Head of Operations Engineering in Morgantown
Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub. We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy and engineering jobs, and work with the leading energy companies worldwide.
We focus on the Oil & Gas, Renewables, Engineering, Power, and Nuclear markets as well as emerging technologies in EV, Battery, and Fusion. We are committed to ensuring that we offer the most exciting career opportunities from around the world for our jobseekers.
Job DescriptionJob DescriptionAbout Company
Talent Quest Solution
About the Opportunity
Industry: Enterprise SaaS Cloud Infrastructure. Sector: Platform Engineering, Site Reliability DevOps for large-scale production systems. Primary title: Head of Engineering Operations.
We are seeking an on-site Head of Operations Engineering to lead platform reliability, automation, and operational excellence for a US-based engineering organization. This role owns the end-to-end production lifecycle—from cloud infrastructure and deployment pipelines to observability, incident response, and cost/scale optimization.
Role Responsibilities
- Reporting structure: Reports to the VP of Engineering and partners closely with the CTO, Product, Security, and Cloud Operations leadership. Responsible for presenting operational performance and roadmap progress to senior leadership and the executive team.
- Team size leadership: Own and grow the Operations Engineering function. Typical span includes 6–10 direct reports and an overall organization of ~20–30 engineers (including embedded SREs, platform engineers, and on-call responders). Hire, onboard, mentor, run performance reviews, and execute succession planning to build a high-performing organization.
- Infrastructure roadmap execution: Define and execute the infrastructure roadmap—implement Infrastructure as Code, automated provisioning, and repeatable deployment patterns using cloud- tooling. Set delivery milestones and measure roadmap progress against agreed timelines.
- CI/CD, orchestration releases: Design, own, and optimize CI/CD pipelines, container orchestration, and release processes to reduce lead time, increase deployment reliability, and lower change failure rates. Drive automation to increase deployment frequency and reduce manual intervention.
- Observability reliability: Establish service-level objectives (SLOs/SLIs), implement observability and alerting frameworks, and lead blameless postmortems and continuous improvement cycles. Track and report SLO compliance and error budget consumption on a regular cadence.
- Incident management on-call expectations: Lead incident management and on-call strategy. Define escalation policies, build and maintain runbooks, and operate as the senior escalation for Sev1/P1 incidents. Expected to be available for major incidents outside business hours as escalation (role-level participation is limited and focused on high-severity incidents); responsible for designing and managing 24x7 team rotations (typical individual on-call cadence ~1 week every 8–12 weeks). Drive reductions in mean time to detect (MTTD) and mean time to resolve (MTTR) through tooling, automation, and process changes.
- KPIs metrics: Own operational KPIs and reporting, including but not limited to: availability/SLA compliance, MTTD, MTTR, deployment frequency, change failure rate, infrastructure cost per service, automation coverage (% of provisioning/deployments automated), and toil hours reduced. Set numeric targets, review them monthly, and use them to prioritize work and demonstrate impact.
- Cross-functional partnership compliance: Partner with Product, Security, and Engineering leadership to embed operational requirements into roadmaps, enforce compliance standards, and manage cloud cost and governance strategies. Influence product design for operability and scalability.
Skills QualificationsMust-Have
- Kubernetes
- AWS
- Terraform
- Docker
- Infrastructure as Code
- CI/CD
- Prometheus
- Grafana
- Python
- Go
- PagerDuty
Benefits Culture Highlights
- On-site, US-based leadership role with visible impact on product reliability and customer experience.
- Opportunity to shape operational strategy, tooling, and engineering best practices across the company.
- Collaborative environment that values ownership, measurable outcomes, and continuous improvement.
Location: USA (On-site). This role is optimized for senior leaders with deep hands-on experience in cloud platform operations, observability, and automation who can translate technical strategy into measurable reliability and delivery gains.
How to Apply Hiring Process
Fill out the application to apply. Give us all of your information so we can get back with you as soon as we can.
If you are interested in applying for this job please press the Apply Button and follow the application process. Energy Jobline wishes you the very best of luck in your next career move.