Site Reliability Engineer (SRE)

Location: London / Warsaw / Paris / New York

Employment Type: Full-time

Department: Engineering / Platform / Infrastructure

About the role

We are looking for a driven and detail-oriented Site Reliability Engineer (SRE) to help design, build, and operate reliable, scalable, and highly available systems. This role bridges software engineering and infrastructure operations, with a strong focus on automation, observability, and continuous improvement.

You’ll work closely with development, platform, and operations teams to ensure our services are resilient, performant, and measurable.

Key responsibilities

  • Design, implement, and maintain reliable and scalable systems

  • Define and monitor SLIs, SLOs, and error budgets

  • Automate infrastructure provisioning, configuration, and operations

  • Improve system observability using logging, monitoring, and tracing tools

  • Respond to and resolve incidents, lead root cause analysis, and drive postmortems

  • Reduce toil through automation and process improvements

  • Support CI/CD pipelines and deployment strategies

  • Collaborate with engineering teams to improve system reliability and performance

  • Participate in on-call rotations and incident response

Required skills & experience

  • Experience in SRE, DevOps, or infrastructure engineering roles

  • Strong background in Linux system administration

  • Proficiency in at least one programming or scripting language (Python, Go, Bash, etc.)

  • Experience with cloud platforms (AWS, Azure, GCP)

  • Knowledge of containerisation and orchestration (Docker, Kubernetes)

  • Familiarity with monitoring and observability tools (Prometheus, Grafana, ELK, etc.)

  • Solid understanding of networking, scalability, and distributed systems

Desirable skills

  • Experience with Infrastructure as Code (Terraform, CloudFormation, ARM)

  • Knowledge of CI/CD tooling and GitOps practices

  • Experience operating high-availability or large-scale production systems

  • Understanding of security, reliability, and performance best practices

  • Relevant certifications (CKA, cloud provider certifications, etc.)

What we offer

  • Competitive salary and benefits package

  • Opportunity to work on large-scale, high-impact systems

  • Strong focus on automation, reliability, and engineering excellence

  • Support for learning, experimentation, and career growth

How to apply

Please email your CV and a brief cover letter outlining your experience and interest in the role.