About the Role

Wafer's mission is to maximize intelligence per watt, by building AI that optimizes AI itself. Our journey starts with GPU kernels, but will expand into every corner of ML systems and AI infrastructure. We're a small team (4 people) backed by Fifty Years, Y Combinator, Jeff Dean, and Woj Zaremba (co-founder of OpenAI), and we're looking for engineers who want to work at the intersection of AI agents and systems programming.

You'll work directly with the founding team to build the systems that power our GPU optimization platform, from the agent framework that iterates on kernels, to the profiling infrastructure that connects to NCU and ROCprofiler, to the compiler tooling that analyzes PTX and SASS.

What You'll Do

Build and improve our framework for GPU kernel optimization (multi-turn tool use, state management, reward signals)
Develop integrations with GPU profilers and compiler toolchains
Design the architecture for remote GPU execution across cloud GPUs
Work on trace analysis systems that help the agent diagnose performance bottlenecks
Ship features that engineers use daily, and that optimizes infrastructure that runs the world's AI (PyTorch, vLLM, NVIDIA, AMD, etc.)

What We Look For

You're a strong fit if you:

Have deep technical intuition and can learn new domains quickly
Are comfortable working across the stack
Can ship production code fast while maintaining quality
Want to work on some of the most interesting AI infra problems at a small company with no bullshit + ship fast culture.

Member of Technical Staff

About the role

About the Role

What You'll Do

What We Look For

About Wafer AI

Other roles at Wafer AI

Job details

Company

Funding

Founders

What happens next.

Confirm the fit

I pitch you to the company

A meeting lands on your calendar