We are planning to enlarge our cool team with a skilled and enthusiastic SITE RELIABITY ENGINEER. Our new colleague will be easily assimilated into the team and engaged in Digital area projects
We will do that with support of a highly motivated squad, working closely with all the members of the team - from Solution architects, Developers, QAs and DevOps and so on - while keeping a close collaboration with the business lines on the projects of course
We are currently seeking an experienced and highly skilled Site Reliability Engineer to join our dynamic team. In this role, you will be responsible for ensuring that our services—both our internally critical and our externally-visible systems—have reliability and uptime appropriate to users' needs and a fast rate of improvement while keeping an ever-watchful eye on capacity and performance
Hands-on experience with Kubernetes, OpenShift or EKS
Hands-on skills in Linux systems
Experience with web application servers and load balancers (any of: F5, Nginx, Apache), databases (any of: MySQL, PostgreSQL, Oracle)
Experience with Docker methodologies and delivery
Experience with monitoring, logging, and alerting technologies (ELK stack and Dynatrace preferred)
Hands-on experience with GIT
Knowledge of building and supporting CI/CD pipelines
Previous production support experience, with can do mind-set and attitude and hands-on mentality
Strong written and verbal communication in English
Ability to work remotely and manage your own time in a team
Interest in designing, analyzing, and troubleshooting large-scale distribution systems
Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
Ability to debug and optimize code and automate routine tasks
Profile
Develop and maintain software that improves the reliability, scalability, and efficiency of your services
Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation, and refinement
Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity
Maintain services once they are live by measuring and monitoring availability, latency, and overall system health
Hands-on experience with Kubernetes, OpenShift or EKS
Hands-on skills in Linux systems
Experience with web application servers and load balancers (any of: F5, Nginx, Apache), databases (any of: MySQL, PostgreSQL, Oracle)
Experience with Docker methodologies and delivery
Experience with monitoring, logging, and alerting technologies (ELK stack and Dynatrace preferred)
Hands-on experience with GIT
Knowledge of building and supporting CI/CD pipelines
Previous production support experience, with can do mind-set and attitude and hands-on mentality
Strong written and verbal communication in English
Ability to work remotely and manage your own time in a team
Interest in designing, analyzing, and troubleshooting large-scale distribution systems
Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
Ability to debug and optimize code and automate routine tasks