HUD (YC W25) is developing agentic evals and RL environments for Computer Use Agents (CUAs) that browse the web for frontier AI labs. Our CUA Evals framework is the first comprehensive evaluation tool for CUAs.
People don't actually know if AI agents are working reliably. To make AI agents work in the real world, we need detailed evals for a huge range of tasks.
We're backed by Y Combinator, and work closely with frontier AI labs to provide agent evaluation and training infrastructure at scale.
Total raised
$21.0M
Last stage
Seed
Investors
Jay Ram
CEO @ HUD prev: consumer apps, ml research, quant research
May Walter
Serial entrepreneur and co-founder of Hud, focused on bridging AI-generated code with production-ready software.
Shai Wininger
CEO of Lemonade and serial entrepreneur co-founding Hud to provide function-level visibility into software behavior in production.
Roee Adler
Serial entrepreneur and co-founder of Hud, the Israeli startup developing a Runtime Code Sensor for real-time production visibility for engineers and AI agents.
No applications, no recruiter spam. Just the intro.
A few questions to make sure this role is the right shape for you. Two minutes.
I write the intro, send it to the founder, and handle the back-and-forth.
Lorenss Martinsons
CPO @ hud interested in natural intelligence, flowers, and the human condition
LinkedInIf they’re a yes, I book the chat. You show up — that’s the whole job-hunt.