Join us at Cleric
We're building an autonomous AI SRE that helps software engineering teams reliably investigate production incidents. Our agent combines LLMs with tools to understand systems, reason through problems, and take corrective actions - even for issues it hasn't encountered before. Our mission is to let engineers focus on building products, not fighting fires.
We're a small team of AI and infrastructure veterans backed by leading AI investors. Cleric is already in production at high scale companies and saving engineers hundreds of hours in investigations.
About the role
You'll help us scale our AI agent across multiple customer environments. You'll design and implement the infrastructure for agent deployment, execution, evaluation, and learning - creating systems that let us run our agents efficiently and reliably.
You'll architect systems to process agent telemetry data, manage our simulation environments, and train our AI agent. This includes building deployment pipelines, scaling mechanisms for handling increased load, and expanding our integrations (observability, cloud provider APIs etc) to suit diverse customer environments.
Beyond the technical implementation, you'll also set practical standards in code review, CI and observability to keep quality high while we scale. You'll mentor other engineers, provide technical direction, and ensure we're making pragmatic architectural decisions that balance immediate needs with long-term scalability.
You'll have technical autonomy in designing and implementing these systems, working closely with our founding team to expand our platform capabilities while maintaining high engineering standards.
What you'll do :
You have :
Nice to have :
How we work :
Interview process (you'll meet most of the team via the process)
Intro Call
Software Engineering Session (1 hour)
System Design Session (90 mins)
Bar Raiser (60 mins)
#J-18808-Ljbffr
Staff Software Engineer • San Francisco, CA, United States