Location: San Francisco   |   Full-Time
AI Vector Database Open Source Rust Go Python Typescript Production Engineering SRE DevOps Cloud Distributed Systems Observability Kubernetes AI Engineer Back End Engineer
Chroma is building the memory layer for AI applications using our open-source vector database. Our mission is to accelerate the useful and creative applications of AI, and we serve as the leading retrieval solution trusted by developers worldwide. We just launched a rewritten version of our product in Rust, and also work in Typescript, Next.js, Go, and Python. Our core technology includes a new open-source, serverless, and distributed database (Rust data plane, Go control plane).

We are seeking talented Production Engineers (related to SRE/DevOps) to join our team in San Francisco (ONSITE). This role focuses on ensuring the reliability, scalability, and performance of Chroma's production systems, both open-source deployments and our upcoming Chroma Cloud offering.

Responsibilities:
- Design, build, and maintain the production environment for Chroma's database and services.
- Develop automation for deployment, monitoring, alerting, and incident response.
- Enhance system observability and performance tuning.
- Work with core technologies including Rust, Go, Python, Typescript, and cloud infrastructure.
- Collaborate with infrastructure and product engineering teams to improve system resilience and operational efficiency.
- Ensure the smooth operation of our distributed and serverless architecture in production.

Ideal Candidate:
- Strong background in Production Engineering, Site Reliability Engineering (SRE), or DevOps.
- Experience managing production systems on cloud platforms (AWS, GCP, Azure).
- Proficiency in scripting (Python, Go) and infrastructure-as-code tools (Terraform, Pulumi).
- Experience with monitoring/observability tools (Prometheus, Grafana, Datadog) and container orchestration (Kubernetes).
- Curious, dedicated to craft, and motivated to ensure system stability and performance.
- Aligned with our operating principles: Ambitious, resilient, truth-seeking, attentive to detail.
- Thrives in a fast-paced, in-person team environment in San Francisco.
Post Date: April 17, 2025