Senior Site Reliability Engineer

Pex (Headquarters: Gainesville, FL)

Location: On-site, Gainesville, FL   |   Full-Time   |   $130,000 - $170,000
Site Reliability Cloud Kubernetes AWS Python Back End Engineer

About Pex: As part of the Vobile Group, Pex develops innovative content recognition technology that protects intellectual property and enables monetization. Our solutions detect modified audio and other content variations that traditional tools can’t identify. We serve clients from large platforms and rights holders to independent creators, making us a key player in the digital content ecosystem.

About The Role: As a Senior Site Reliability Engineer at Pex, you’ll design, build, and maintain the highly available, scalable infrastructure that powers our content recognition systems. You’ll work with distributed systems, cloud platforms, and automation tools to ensure our services meet the demands of our global client base. This role requires deep technical expertise in infrastructure and a passion for building reliable systems that handle massive content processing workloads.

Key Responsibilities:

  • Design and implement scalable, fault-tolerant systems for content processing and recognition
  • Optimize infrastructure for performance, cost-efficiency, and reliability
  • Develop and maintain automated deployment and monitoring systems
  • Troubleshoot and resolve complex production issues across distributed systems
  • Collaborate with engineering teams to improve system architecture and implementation
  • Implement monitoring, alerting, and logging solutions to ensure system health
  • Optimize database performance and data processing pipelines
  • Stay current with industry trends in cloud computing, containerization, and infrastructure automation

Required Skills and Qualifications:

  • 5+ years of experience in site reliability engineering or systems administration
  • Proven track record of designing and maintaining scalable, high-performance systems
  • Expertise with cloud platforms (AWS, Azure, or GCP)
  • Strong knowledge of containerization (Docker, Kubernetes) and orchestration tools
  • Experience with infrastructure automation (Ansible, Terraform, or similar)
  • Deep understanding of distributed systems, networking, and database technologies
  • Proficiency in scripting languages (Python, Bash) and cloud-native tools
  • Experience with monitoring and observability tools (Prometheus, Grafana, ELK stack)

Ideal Candidate: We’re seeking an experienced SRE with a passion for building robust infrastructure that scales with our growing client base. You should have a strong background in system design, automation, and troubleshooting, with a focus on creating reliable services. Experience with content processing or media systems is a plus. This role offers the chance to work on challenging infrastructure problems that directly impact our core technology and global operations.

Benefits: Competitive salary, unlimited PTO, comprehensive health benefits, and a supportive work environment.

Post Date: July 17, 2025