Inception builds diffusion-based LLMs that make AI dramatically faster, cheaper, and more efficient.
Inception creates the world’s fastest, most efficient AI models. Today’s autoregressive LLMs generate tokens sequentially, which makes them painfully slow and expensive. Inception’s diffusion-based LLMs (dLLMs) generate answers in parallel. They are up to 10X faster and more efficient, while delivering best-in-class quality. Inception pioneered the application of diffusion to language, launching the world’s first commercially available dLLM, Mercury, in early 2025, and is currently deploying large-scale diffusion LLMs at Fortune 500 companies. Diffusion is the technology behind today’s image and video AI, and Inception making it the standard for LLMs as well.
Total raised
$56.0M
Last stage
Seed
Investors
Stefano Ermon
Stanford University professor who co-invented diffusion models.
Aditya Grover
UCLA professor.
Volodymyr Kuleshov
Cornell University professor.
No applications, no recruiter spam. Just the intro.
A few questions to make sure this role is the right shape for you. Two minutes.
I write the intro, send it to the founder, and handle the back-and-forth.
If they’re a yes, I book the chat. You show up — that’s the whole job-hunt.