Site Reliability Engineer - Database and Observability

Exoscale (Headquarters: Lausanne, Switzerland)

Location: Lausanne, Switzerland or remote in Europe   |   Full-Time
Cloud SRE Database Observability Monitoring MariaDB Cassandra FoundationDB Kafka Prometheus Go Distributed Systems Linux Automation Scalability Reliability Infrastructure Configuration Management Back End Engineer Data Engineer Staff Engineer
Exoscale is the leading Swiss/European cloud service provider. Join a dynamic working environment with a cutting-edge distributed team. Exoscale strives to create an environment with great working conditions and welcomes diverse applicants.

As part of its ongoing efforts to grow its infrastructure footprint Exoscale is hiring a Site Reliability Engineer (SRE). The SRE plays a critical role in ensuring constant availability of the Exoscale platform. This position focuses on designing, developing and maintaining Exoscale’s core platform databases and observability stack.

Some of the challenges you will be working on:
* Maintain and optimize our persistent data infrastructure, including MariaDB, Cassandra, FoundationDB, and Kafka.
* Enhance and evolve our observability stack to improve system visibility and performance monitoring.
* Take part in automation and orchestration efforts to streamline operations and reduce manual intervention.
* Improve processes to ensure scalability, reliability, and high availability of our infrastructure.
* Join the on-call rotation after completing a training period.

Ideal candidates are:
* Experienced with Linux and have a deep understanding of systems administration.
* Proficient in MariaDB and experienced in managing large-scale database deployments.
* Proficient in Go programming language and understands distributed systems principles
* Familiar with Prometheus and the broader observability ecosystem.
* Experienced (or is eager to learn) Kafka, Cassandra, and/or FoundationDB.
* Skilled in configuration management and managing large-scale infrastructure.
* Passionate about automation. Looking for ways to optimize workflows and reduce manual effort.
* Team players who thrive in a distributed team environment.
* Curious, autonomous, and eager to learn new technologies every day.
* Strong communicators in English, both written and spoken.

What we offer:
* Flexible working hours and working from home.
* Autonomous working conditions with a lot of freedom to create.
* Modern working atmosphere and centrally located office with great public transport connection
* Team events as well as training and further education.

Candidates who are not familiar with all the topics above but willing to learn are encouraged to apply.
Post Date: May 26, 2025