About the Role
We’re looking for an experienced Site Reliability Engineer (SRE) to help us scale our platform with reliability, observability, and operational excellence at the core. You’ll partner with engineers and data scientists to build, automate, and maintain the infrastructure that powers our core platform—including data pipelines, ML workloads, and real-time analytics systems.
This is a hands-on, high-impact role with visibility across the stack and the opportunity to shape the future of our infrastructure and operations.
Key Responsibilities
Must-Have Qualifications
Nice-to-Have
What You’ll Get
Alembic is solving one of AI's defining challenges: proving cause and effect in proprietary enterprise data at scale. While generic models converge in capability, we're building the systems that process private corporate data to answer questions ChatGPT cannot—connecting marketing exposure to revenue, operations to outcomes, strategy to results.
Our customers include Nvidia, Delta Air Lines, Mars, and leading Fortune 500 companies making multi-million dollar decisions with our platform. The technical demands were so extreme we literally melted GPUs building this, which is why we're now operating one of the world's fastest private supercomputers. Backed by $145M from Prysm Capital, Accenture, WndrCo, Silver Lake Waterman, and others, we're scaling the team redefining enterprise competitive advantage in the age of AI.
Salary
$170,000 - $250,000
Location
San Francisco, CA
Total raised
$171.0M
Last stage
Series B
Investors
No applications, no recruiter spam. Just the intro.
A few questions to make sure this role is the right shape for you. Two minutes.
I write the intro, send it to the founder, and handle the back-and-forth.
If they’re a yes, I book the chat. You show up — that’s the whole job-hunt.