Site Reliability Engineer
About UsLoyaltyLion is a data-driven loyalty and engagement platform trusted by thousands of ecommerce brands worldwide. Merchants use LoyaltyLion when they want a loyalty program that is proven to increase customer engagement, retention and spend. Stores using LoyaltyLion typically generate at least $15 for every $1 they spend on the platform.Today LoyaltyLion works with over 10,000 small and medium sized retailers. Our mission is to help them succeed in the age of Amazon, where they may not be able to compete on price and logistics but can offer a better customer experience. An experience where customers feel valued, rather than just another number. It’s been an incredible two years for LoyaltyLion. We closed $ early last year and another $12m this year, and we’ve grown from 40 employees to over 100. We’ve built out our Leadership team, recruiting a CTO, CFO and Director of Product amongst other senior hires and we continue to scale quickly, achieving spots in both the Deloitte Fast 50 and the FT1000. This is just the beginning of our inflection point. The RoleWe are looking for a Site Reliability Engineer to join our team and support LoyaltyLion's growth. Working with our SRE Lead, you will be responsible for ensuring the reliability, availability, and performance of our platform's infrastructure and systems. You'll also support our Data team in the provisioning and tuning of our Data platform, and our development teams in optimising their applications and CI/CD pipelines for peak performance and efficiency.Please note this is a fully remote position, within the UTC-0 and UTC+2 timezonesSome of the things you'll be doingDelivering clean, architecturally sound, maintainable and secure infrastructureWorking closely with AWS infrastructure, particularly focusing on data services, to support database scalability and availabilityWork with LoyaltyLion engineering teams to support the infrastructure they need and the platforms on which their services runConducting performance tuning to optimise database performance and enhance data processing efficiencyImplementing observability systems for infrastructure and data to ensure reliability and availability, find areas for improvement, and proactively access risks to the stability or security of our platformMaintain new and existing infrastructure with code, by writing well-designed Terraform code to make the best use of our AWS infrastructure.Documenting and driving the adoption of DevOps best practices across the wider engineering team Conducting proofs-of-concept on new and emerging technologies and evaluating the fit to LoyaltyLionTaking part in honest and transparent blame-free post-mortems on incidents we have, so we can learn from them and prevent them from happening againAutomate and accelerate - reduce manual tasks and allow all of LoyaltyLion engineers to concentrate on building exciting new featuresBuild, measure, learn - implement the best observability tools to continually improve LoyaltyLion performanceWhat we’re looking for4+ years of experience with AWSIn-depth knowledge of defining infrastructure as code using TerraformExperience in agile development practicesObservability & Monitoring using DataDog & Cloudwatch or similar systemsBonus points for real-time low latency high-frequency transaction-based systems experienceExtra bonus points for experience with Redshift, Glue, Airflow, Athena or any other frameworks and tools for data engineeringAbility to diagnose problems at any level (Client, HTTP/Network, Server, Database, OS)Ability to write clear, concise documentationOur StackAWSDockerECS (Fargate)DataDogPagerDutyPostgres, RedshiftInfra as code: Terraform, Ansible, PackerScripting: Bash, Ruby, PythonBuildkiteThe Engineering team works on a fully remote basis. However you do have the option of working from our shiny new HQ in Farringdon.Interview ProcessTA ScreenTechnical Overview + Value based interview Tech Session + Q&A with Engineering TeamMeet the CTOBenefits• Flexible working• International Remote working (up to 30 days in each holiday year)• 25 days holiday + bank holidays + carry 5 days holiday over into the next holiday year• All permanent employees get equity to recognise the valuable contribution you'll make to our growth• Company days out and events, and team socials• Home office budget• Cycle scheme• Employee Assistance Program• Private medical insurance• Competitive learning and development budget• The opportunity to join at a major inflection point – ecommerce is booming and with it, the demand for loyalty software like LoyaltyLion• Macbook, magic keyboard, and any other tech or equipment you need to do a great job