Pex Job - Senior Site Reliability Engineer

About Pex: As part of the Vobile Group, Pex develops innovative content recognition technology that protects intellectual property and enables monetization. Our solutions detect modified audio and other content variations that traditional tools can’t identify. We serve clients from large platforms and rights holders to independent creators, making us a key player in the digital content ecosystem.

About The Role: As a Senior Site Reliability Engineer at Pex, you’ll design, build, and maintain the highly available, scalable infrastructure that powers our content recognition systems. You’ll work with distributed systems, cloud platforms, and automation tools to ensure our services meet the demands of our global client base. This role requires deep technical expertise in infrastructure and a passion for building reliable systems that handle massive content processing workloads.

Key Responsibilities:

Design and implement scalable, fault-tolerant systems for content processing and recognition
Optimize infrastructure for performance, cost-efficiency, and reliability
Develop and maintain automated deployment and monitoring systems
Troubleshoot and resolve complex production issues across distributed systems
Collaborate with engineering teams to improve system architecture and implementation
Implement monitoring, alerting, and logging solutions to ensure system health
Optimize database performance and data processing pipelines
Stay current with industry trends in cloud computing, containerization, and infrastructure automation

Required Skills and Qualifications:

5+ years of experience in site reliability engineering or systems administration
Proven track record of designing and maintaining scalable, high-performance systems
Expertise with cloud platforms (AWS, Azure, or GCP)
Strong knowledge of containerization (Docker, Kubernetes) and orchestration tools
Experience with infrastructure automation (Ansible, Terraform, or similar)
Deep understanding of distributed systems, networking, and database technologies
Proficiency in scripting languages (Python, Bash) and cloud-native tools
Experience with monitoring and observability tools (Prometheus, Grafana, ELK stack)

Ideal Candidate: We’re seeking an experienced SRE with a passion for building robust infrastructure that scales with our growing client base. You should have a strong background in system design, automation, and troubleshooting, with a focus on creating reliable services. Experience with content processing or media systems is a plus. This role offers the chance to work on challenging infrastructure problems that directly impact our core technology and global operations.

Benefits: Competitive salary, unlimited PTO, comprehensive health benefits, and a supportive work environment.