Location: Remote   |   Contract
SRE DevOps Kubernetes Monitoring Python Go C++ Rust JavaScript Networking CDN Cloudflare AWS Azure CI/CD Distributed Systems Security Scalability Performance Availability Back End Engineer Staff Engineer
Intuition Machines uses AI/ML to build enterprise security products like hCaptcha, serving hundreds of millions of users daily. We operate with low overhead, small teams, and rapid iteration.

As a Site Reliability Engineer (SRE), you will focus on engineering solutions for performance, availability, security, and cost-effectiveness at internet scale (millions of requests/sec). You will work across infrastructure, data, and application layers to build robust solutions.

What you will do:
* Work with large-scale systems (millions RPS, millions users, multi-cloud).
* Develop solutions to enhance performance, availability, security, cost-effectiveness.
* Ensure system uptime, speed, and developer productivity.
* Improve quality, security, uptime, speed-to-deliver, threat detection, customer engagement.
* Source improvement ideas from customers, internal community, metrics. Make decisions rapidly.
* Be creative and drive value creation for customer experience.

What we are looking for:
* Expert in Kubernetes.
* Expert in monitoring applications, infrastructure and network.
* Background in software engineering with backend expertise in Kubernetes-based systems.
* Strong programming skills in Python, JavaScript, Go, C++, or Rust.
* Strong understanding of networking, proxies, CDNs (Cloudflare).
* Multi-cloud experience (virtual networking, load balancing, WAF).
* Strong CI/CD experience.
* Hands-on experience in high-scale, high-uptime, high-reliability environments.
* Minimum 6 years hands-on experience in related roles (engineering, DevOps, SRE).
* Familiarity with distributed systems (queue-first architectures, sharding).
* Demonstrated engineering expertise (requirements gathering, problem-solving, recommendations).

Preferred:
* Familiarity with security frameworks, attack vectors, botnets, impact analysis.

What we offer:
* Fully remote position with flexible working hours.
* An inspiring team of colleagues spread all over the world.
* Pleasant, modern development and deployment workflows: ship early, ship often.
* High impact: lots of users, happy customers, high growth, and cutting edge R&D.
* Flat organization, direct interaction with customer teams.
Post Date: April 21, 2025