Wafer's mission is to maximize intelligence per watt, by building AI that optimizes AI itself. Our journey starts with GPU kernels, but will expand into every corner of ML systems and AI infrastructure. We're a small team (4 people) backed by Fifty Years, Y Combinator, Jeff Dean, and Woj Zaremba (co-founder of OpenAI), and we're looking for engineers who want to work at the intersection of AI agents and systems programming.
You'll work directly with the founding team to build the systems that power our GPU optimization platform, from the agent framework that iterates on kernels, to the profiling infrastructure that connects to NCU and ROCprofiler, to the compiler tooling that analyzes PTX and SASS.
Build and improve our framework for GPU kernel optimization (multi-turn tool use, state management, reward signals)
Develop integrations with GPU profilers and compiler toolchains
Design the architecture for remote GPU execution across cloud GPUs
Work on trace analysis systems that help the agent diagnose performance bottlenecks
Ship features that engineers use daily, and that optimizes infrastructure that runs the world's AI (PyTorch, vLLM, NVIDIA, AMD, etc.)
You're a strong fit if you:
Have deep technical intuition and can learn new domains quickly
Are comfortable working across the stack
Can ship production code fast while maintaining quality
Want to work on some of the most interesting AI infra problems at a small company with no bullshit + ship fast culture.
Very nice to have:
GPU programming experience (CUDA, HIP, Triton)
Experience with profiling tools or compiler internals
Background in AI/ML research or agent systems
Publications or open-source work in relevant areas
Wafer builds AI agents that work as autonomous performance engineers, optimizing GPU kernels for AI inference. Our customers are chip companies and cloud providers who need their AI models running at peak performance on any type of hardware. Our founding team includes engineers from Google (Spanner, Gemini), Two Sigma, AWS, and Argonne National Lab, with NeurIPS publications in ML.
Salary
$150,000 - $250,000
Equity
1% - 2%
Location
San Francisco, CA, US
Last stage
Seed
Investors
No applications, no recruiter spam. Just the intro.
A few questions to make sure this role is the right shape for you. Two minutes.
I write the intro, send it to the founder, and handle the back-and-forth.
If they’re a yes, I book the chat. You show up — that’s the whole job-hunt.