Skip to main content

Site-Reliability Engineer in Phoenix

Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub. We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy and engineering jobs, and work with the leading energy companies worldwide.

We focus on the Oil & Gas, Renewables, Engineering, Power, and Nuclear markets as well as emerging technologies in EV, Battery, and Fusion. We are committed to ensuring that we offer the most exciting career opportunities from around the world for our jobseekers.

Job DescriptionJob Description

Job Description:

•Min 3-5 years of Service reliability/operation experience running large scale, high performance applications in a hybrid environment (on-prem and cloud).

•Min 3-5 years of experience writing automation scripts and building dashboards for Application Performance management to manage Transaction journeys.

•2-4 years of Experience working with Programming such as Go, Python, Java, Rust etc.

•Working knowledge on with one or more databases-Oracle, PL/SQL, SQL Server, Redis, Clickhouse, postgres, Mongo or any time-series databases

•At least 2+ years of Experience transitioning platforms to the cloud and Containerization - GCP, AWS and Rancher (or Cloud Formation, Azure and OpenShift).

•Experience maintaining containerized app in GKE/RKE/AKE environments.

•Experience Implementing Cloud observability using OTEL to enable real-time monitoring, distributed tracing and incident resolution.

•Experience working with specific GraphQL Framework (Apollo, Prisma, Hasura etc...).

•Experience using knowledge of networking protocols such as TCP/IP, HTTP, DNS, Load balancing and service mesh to troubleshoot issues in high pressure situations.

•Proven experience managing Application availability, building creative solutions to manage repetitive activities, improve gating and detect for applications at every touchpoint for a 24 x 7 High availability platform exposed to critical clients and customers.

•Working knowledge of Monitoring tools - Splunk, App-dynamics, grafana/Prometheus and Dynatrace.

•Experience with tools like Rally, Confluence and other CI/CD extenders.

•Hands-on experience with implementing in-memory caching solutions. Experience on Redis DB is a plus.

•Excellent debugging skills across variety of integrated technical platforms on API gateway.

•Hands-on with GCS, Cloud SQL, PL?SQL and Spanner.

•Monitor and troubleshoot HashiCorp Vault environments, ensuring minimal downtime and rapid recovery from incidents.

•Working knowledge on Vertex Al, Gen Al and Bigquery.

If you are interested in applying for this job please press the Apply Button and follow the application process. Energy Jobline wishes you the very best of luck in your next career move.

Site-Reliability Engineer in Phoenix

Phoenix, AZ
Full time

Published on 11/16/2025

Share this job now