Site Reliability Engineer - Tribus
Job Description
A global quantitative trading firm is looking to hire a skilled Site Reliability Engineer to help build and support the next of its low-latency trading systems. Leading compensation packages on offer. - $400 - 800k
This is a hands-on, high-impact role that sits at the intersection of software engineering and infrastructure, where your work directly supports real-time trading performance across global markets.
Key Responsibilities
- Design, build, and manage scalable, resilient infrastructure to support trading and research systems.
- Develop tooling for observability, CI/CD pipelines, and automation.
- Collaborate closely with software engineers, researchers, and traders to ensure performance and reliability.
- Implement best practices in monitoring, alerting, capacity planning, and incident response.
- Proactively identify and resolve system issues to minimize downtime.
Requirements
- Strong experience in Linux systems administration and networking fundamentals.
- Proficiency in python is a must.
- Experience with containerisation and orchestration tools (Docker, Kubernetes).
- Understanding of distributed systems, real-time performance, and low-latency requirements.
- Exposure to public cloud (AWS, GCP, Azure) a plus, but not essential.