Staff Site Reliability Engineer
Job Description
Almedia is the fastest-growing advertising company in Europe, according to the Financial Times. Based in the heart of Berlin, we offer mobile game and app developers unparalleled returns from rewarded user acquisition, engineering the future of UA with our data-driven approach and community of over 50 million users.We are at the forefront of a shift in the mobile app advertising landscape, changing the way people find and engage with apps. The industry is adapting around Almedia's approach, and we are building a team that can push us even further.Staff Site Reliability Engineer / DevOpsBerlin () or RemoteAbout youAn SRE or DevOps engineer with hands-on experience in high-traffic production systemsStrong in Linux, databases (MySQL, Postgres, MongoDB, Redis), and networking fundamentalsComfortable with Kubernetes, CI/CD pipelines, and observability tools like DatadogA self-starter who thrives in scaling environments and can work independently without PMsPragmatic, able to balance prevention, maintenance, and firefighting when neededYour mission is toTake ownership of uptime and reliability for a platform serving 50M+ usersBuild robust monitoring, alerting, and incident response practicesImprove CI/CD pipelines and enable safe deployments (blue-green, canary)Partner with engineers across teams to fix pain points in infra, tooling, and reliabilityBring initiatives that make the platform automatically reliable, cost-efficient, and scalableYour impactCollaborate with engineering teams to improve operational workflows and resilienceDesign smart alerts, improve observability, and drive better performance monitoringLead incident response, including on-call, and drive improvement with blameless postmortemsBuild safer delivery methods and improve deployments with Kubernetes and GitLab pipelinesReport directly to the CTO and act as the primary reliability leader in the companyYour toolkitLinux, networking (TCP/IP), and distributed systems troubleshootingDatabases: MySQL, Postgres, MongoDB, RedisKubernetes, GitLab pipelines, CI/CD best practicesObservability tools like Datadog, OpenTelemetry, or ELK stackNice-to-haves: RabbitMQ, Kafka, Terraform, Ansible, GCP, DatadogWhat makes this role excitingBe the first senior SRE hire with ownership of reliability across the entire platformShape infrastructure and processes for a scale-up growing beyond 100 FTEWork on a product serving millions of users worldwide with real engineering challengesGain autonomy while collaborating with strong product and engineering teamsJoin a culture that values pragmatism, initiative, and continuous improvementWhy Almedia?Own Our Growth: We offer all Berlin-based employees equity in Almedia to truly be a part of our success.Scale With Almedia: Grow alongside a startup that has been profitable from day one.Central Berlin Office: Work from a fully-stocked modern office built for collaboration, accessible from all around Berlin.Other Benefits: Transport subsidy, breakfasts and lunches, learning, Urban Sports Club, and more.We Listen: We regularly add to our benefits through rigorous employee feedback.We believe in fostering talent, evaluating all skill levels during the hiring process, and providing a clear path for growth.
Almedia is an equal opportunity employer. We embrace and celebrate , and encourage individuals from all backgrounds to apply.