#1 Job Board for tech industry in Europe

  • Job offers
  • Staff Engineer - Site Reliability Engineering (SRE)
    New
    DevOps

    Staff Engineer - Site Reliability Engineering (SRE)

    6 299 - 9 449 USDGross/month - Permanent
    Type of work
    Full-time
    Experience
    Senior
    Employment Type
    Permanent
    Operating mode
    Hybrid

    Tech stack

      DevOps

      advanced

      Azure

      advanced

      Python

      regular

      Go

      regular

      Java

      regular

      CI/CD

      regular

      SaaS

      regular

    Job description

    Job Overview

    The Staff Engineer - SRE will be the technical leader of the global Site Reliability Engineering (SRE) team, driving the vision, strategy, and execution plan for the function. This role is critical in defining and implementing best practices for system reliability, scalability, and performance across the technical organization.

    As a key member of the engineering leadership team, the Staff Engineer will work closely with Infrastructure, Engineering, and Product teams to develop highly resilient, observable, and automated solutions that enhance system availability and efficiency. The ideal candidate will bring deep technical expertise, strong problem-solving skills, and a passion for reliability engineering.


    Job Responsibilities 

    • Participation in defining and leading the SRE vision and strategy, ensuring alignment with business objectives and engineering priorities. 
    • Architect, implement, and advocate for best-in-class reliability, observability, and scalability practices across the platform. 
    • Develop automated solutions for system reliability, capacity planning, and incident response to minimize manual intervention. 
    • Partner with engineering and product teams to design and implement highly available and fault-tolerant systems in Azure Cloud. 
    • Participate in improving Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to enhance system reliability. 
    • Support root cause analysis (RCA) investigations, drive corrective actions, and advocate for a blameless postmortem culture. 
    • Influence and mentor engineering teams on SRE principles, DevOps culture, and best practices. 
    • Stay ahead of industry trends, adopting new tools, frameworks, and methodologies to continually improve system reliability. 


    Preferred Qualifications 

    • 8+ years of experience in software engineering, site reliability engineering, or cloud infrastructure roles. 
    • 5+ years of experience with DevOps tooling and practices, with prior experience as a Staff, Principal, or Distinguished Engineer. 
    • Proven expertise in designing and operating large-scale distributed systems in Azure Cloud. 
    • Proficient in designing and building service-oriented architectures and cloud-based distributed systems. 
    • Strong programming experience in languages such as Python, Go, Java, or C# or .Net. 
    • In-depth technical understanding and experience with at least two of the following DevOps platforms: GitHub, Azure DevOps, GitLab, or Jenkins. 
    • Hands-on experience with observability tools (e.g., Prometheus, Grafana, OpenTelemetry, Datadog, or New Relic). 
    • Strong background in CI/CD pipelines, automation, and DevOps practices. 
    • Experience working in global, high-availability SaaS environments. 
    • Proficient in conducting and communicating evaluation and selection processes. 
    • Experience implementing redundancy and disaster recovery scenarios. 
    • Excellent teamwork and cross-group collaboration skills. 
    • Ability to collaborate with both technical and business professionals. 
    • Hands-on experience with Agile Project Development Methodologies. 
    • Experience delivering complex technical solutions. 
    • Excellent problem-solving, analytical, and communication skills. 
    • Previous experience in leading or mentoring engineers in a reliability-focused capacity. 


    Competencies and Skills 

    • Technical Leadership – Ability to set technical direction and drive cross-functional collaboration. 
    • Systems Thinking – Strong grasp of distributed systems, networking, and cloud architectures. 
    • Automation-First Mindset – Commitment to reducing toil through scripting and automation. 
    • Reliability Engineering – Expertise in SLOs, SLIs, error budgets, and high-availability architectures. 
    • Incident Management & Postmortems – Experience in handling production incidents and driving continuous improvement. 
    • Observability & Monitoring – Deep understanding of logging, monitoring, and alerting best practices. 
    • Practical knowledge of data structures and modern data engines. 
    • Collaboration & Communication – Ability to work across teams, influence stakeholders, and advocate for reliability improvements. 
    • Mentorship & Coaching – Passion for mentoring engineers and building an SRE culture within the organization. 


    Relativity is committed to competitive, fair, and equitable compensation practices

    tutlo_banner_hero

    Practice your English before your job interview!

    Get 3 free English lessons
    6 299 - 9 449 USD

    Gross/month - Permanent

    Check similar offers

    Senior DevOps Engineer

    New
    Euvic S.A.
    Undisclosed Salary
    Gliwice
    , Fully remote
    Fully remote
    IaC
    Prometheus
    Docker

    Senior DevOps Engineer with AWS

    New
    Acaisoft
    5.04K - 6.8K USD
    Warszawa
    , Fully remote
    Fully remote
    Kubernetes
    AWS
    CI/CD

    (Senior) DevOps Engineer

    New
    Empik
    Undisclosed Salary
    Warszawa
    , Fully remote
    Fully remote
    Jenkins
    Prometheus
    Terraform

    DevOps Engineer

    New
    H2B Group
    5.08K - 5.93K USD
    Gdańsk
    , Fully remote
    Fully remote
    DevOps
    Red Hat
    Kubernetes

    Senior Architect (MS Dynamics 365)

    New
    Experis Manpower Group
    8.47K - 9.31K USD
    Warszawa
    , Fully remote
    Fully remote
    CI/CD
    Power Automate
    MS Dynamics 365