Company Overview Deepgram is a foundational AI company building state of the art, production-ready AI models that streamline human-computer interaction and amplify productivity. By enabling seamless communication between humans and machines, we believe we can harness the untapped potential of AI and help pave the way for a more productive future. We passionately believe in the potential of audio data to transform lives, businesses, and interactions across the globe - which is why Deepgram is trusted by well-respected companies like NASA, Twilio, Auth0, and Spotify to push the boundaries of what is possible in voice technology!
The Opportunity At Deepgram, we spend every day tackling big, real-world challenges in voice. Our customers hire us to solve their hardest problems, taking real, complex audio and transforming it into novel insights. And to raise the bar, everything we build needs scale in its DNA. We aren’t content with simple horizontal scaling: we intend to replace entire data centers dedicated to speech analytics with a single rack of servers. These challenges demand creativity and innovative problem-solving every day.
As a Research Scientist at Deepgram, you’ll have the freedom to explore and uncover breakthroughs. You’ll also have a mandate to build -- applying the latest advancements in deep learning to develop accurate and performant voice AI models. You will collaborate with product & engineering to help deploy these models in the most scalable speech API on the planet. We look forward to you bringing your whole self to work, sharing learnings from your latest experiments, and collaborating with us to advance the state of AI and voice technology.
The Role Deepgram is currently looking for an experienced Research Scientist who has worked extensively on building models to solve hard problems in voice AI domains including automatic speech recognition (ASR), text-to-speech (TTS), diarization and speaker identification, language detection, or code switching. Voice AI is a challenging problem space which involves dealing with raw audio waveforms generated by the human voice. The complexity of audio data poses unique infrastructure, engineering, and modeling challenges which are orders of magnitude more difficult than working with text. You should have extensive experience working on the hard technical aspects around deep learning for audio such as speech data curation and characterization, development of expressive and efficient neural network architectures for speech, distributed training at large-scales, and optimization of speech models for inference at scale.
What You’ll Do
You’ll Love This Role If You
It’s Important To Us That You Have
It Would Be Great if You Had
Backed by prominent investors including Y Combinator, Madrona, Tiger Global, Wing VC and NVIDIA, Deepgram has raised over $85 million in total funding after closing our Series B funding round last year. If you're looking to work on cutting-edge technology and make a significant impact in the AI industry, we'd love to hear from you!
Deepgram is an equal opportunity employer. We want all voices and perspectives represented in our workforce. We are a curious bunch focused on collaboration and doing the right thing. We put our customers first, grow together and move quickly. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate.
We are happy to provide accommodations for applicants who need them.
Deepgram is a foundational AI company on a mission to understand human language. We give any developer access to the most advanced speech AI transcription and understanding with just an API call.
Our models deliver the fastest, most accurate transcription alongside contextual features like summarization, sentiment analysis, and topic detection. Beyond that, developers can: 🔊 Process live-streaming or pre-recorded audio 🌎 Transcribe in dozens of languages ⚙️ Train custom models for unique use cases 🔑 Access deep NLU with a unified API 💻 Build in any programming language with our SDKs ✅ Deploy on-prem or on DG’s managed cloud 📈 Get scalable GPU infra for training and inference
Deepgram is a proud NVIDIA partner and Y Combinator company, and we recently completed a $72M Series B to define the future of AI Speech Understanding, making us the most-funded speech AI company at its stage.
Salary
$150,000 - $250,000
Location
Remote
Experience
3+ years
Total raised
$202
Last stage
Series B
Investors
Adam Sypniewski
Founder
Noah Shutty
Founder
Scott Stephenson
No applications, no recruiter spam. Just the intro.
A few questions to make sure this role is the right shape for you. Two minutes.
I write the intro, send it to the founder, and handle the back-and-forth.
If they’re a yes, I book the chat. You show up — that’s the whole job-hunt.