#1 Job Board for tech industry in Europe

Top Companies

Geek

Staff Engineer - Site Reliability Engineering (SRE)

New

DevOps

Staff Engineer - Site Reliability Engineering (SRE)

Relativity

6 299 - 9 449 USDGross/month - Permanent

Type of work

Full-time

Experience

Senior

Employment Type

Permanent

Operating mode

Hybrid

Tech stack

DevOps

advanced

Azure

advanced

Python

regular

Go

regular

Java

regular

CI/CD

regular

SaaS

regular

Job description

Job Overview

The Staff Engineer - SRE will be the technical leader of the global Site Reliability Engineering (SRE) team, driving the vision, strategy, and execution plan for the function. This role is critical in defining and implementing best practices for system reliability, scalability, and performance across the technical organization.

As a key member of the engineering leadership team, the Staff Engineer will work closely with Infrastructure, Engineering, and Product teams to develop highly resilient, observable, and automated solutions that enhance system availability and efficiency. The ideal candidate will bring deep technical expertise, strong problem-solving skills, and a passion for reliability engineering.

Job Responsibilities

Participation in defining and leading the SRE vision and strategy, ensuring alignment with business objectives and engineering priorities.
Architect, implement, and advocate for best-in-class reliability, observability, and scalability practices across the platform.
Develop automated solutions for system reliability, capacity planning, and incident response to minimize manual intervention.
Partner with engineering and product teams to design and implement highly available and fault-tolerant systems in Azure Cloud.
Participate in improving Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to enhance system reliability.
Support root cause analysis (RCA) investigations, drive corrective actions, and advocate for a blameless postmortem culture.
Influence and mentor engineering teams on SRE principles, DevOps culture, and best practices.
Stay ahead of industry trends, adopting new tools, frameworks, and methodologies to continually improve system reliability.

Preferred Qualifications

8+ years of experience in software engineering, site reliability engineering, or cloud infrastructure roles.
5+ years of experience with DevOps tooling and practices, with prior experience as a Staff, Principal, or Distinguished Engineer.
Proven expertise in designing and operating large-scale distributed systems in Azure Cloud.
Proficient in designing and building service-oriented architectures and cloud-based distributed systems.
Strong programming experience in languages such as Python, Go, Java, or C# or .Net.
In-depth technical understanding and experience with at least two of the following DevOps platforms: GitHub, Azure DevOps, GitLab, or Jenkins.
Hands-on experience with observability tools (e.g., Prometheus, Grafana, OpenTelemetry, Datadog, or New Relic).
Strong background in CI/CD pipelines, automation, and DevOps practices.
Experience working in global, high-availability SaaS environments.
Proficient in conducting and communicating evaluation and selection processes.
Experience implementing redundancy and disaster recovery scenarios.
Excellent teamwork and cross-group collaboration skills.
Ability to collaborate with both technical and business professionals.
Hands-on experience with Agile Project Development Methodologies.
Experience delivering complex technical solutions.
Excellent problem-solving, analytical, and communication skills.
Previous experience in leading or mentoring engineers in a reliability-focused capacity.

Competencies and Skills

Technical Leadership – Ability to set technical direction and drive cross-functional collaboration.
Systems Thinking – Strong grasp of distributed systems, networking, and cloud architectures.
Automation-First Mindset – Commitment to reducing toil through scripting and automation.
Reliability Engineering – Expertise in SLOs, SLIs, error budgets, and high-availability architectures.
Incident Management & Postmortems – Experience in handling production incidents and driving continuous improvement.
Observability & Monitoring – Deep understanding of logging, monitoring, and alerting best practices.
Practical knowledge of data structures and modern data engines.
Collaboration & Communication – Ability to work across teams, influence stakeholders, and advocate for reliability improvements.
Mentorship & Coaching – Passion for mentoring engineers and building an SRE culture within the organization.

Relativity is committed to competitive, fair, and equitable compensation practices

Practice your English before your job interview!

Get 3 free English lessons

6 299 - 9 449 USD

Gross/month - Permanent

Check similar offers

Senior DevOps Engineer

New

Euvic S.A.

Undisclosed Salary

Gliwice

, Fully remote

Fully remote

IaC

Prometheus

Docker

Senior DevOps Engineer with AWS

New

Acaisoft

5.04K - 6.8K USD

Warszawa

, Fully remote

Fully remote

Kubernetes

AWS

CI/CD

(Senior) DevOps Engineer

New

Empik

Undisclosed Salary

Warszawa

, Fully remote

Fully remote

Jenkins

Prometheus

Terraform

DevOps Engineer

New

H2B Group

5.08K - 5.93K USD

Gdańsk

, Fully remote

Fully remote

DevOps

Red Hat

Kubernetes

Senior Architect (MS Dynamics 365)

New

Experis Manpower Group

8.47K - 9.31K USD

Warszawa

, Fully remote

Fully remote

CI/CD

Power Automate

MS Dynamics 365