Today, when you go to your doctor and get referred to a specialist, your doctor sends out a referral and tells you, “They’ll be in touch soon.” So you wait. And wait. Sometimes days, weeks, or even months. Why? Because too often providers are overwhelmed with the painstakingly tedious work required to get paid by insurance companies. Powered by proprietary models, Tennr handles the complex paperwork that gets patients through the door and providers paid, helping operators get patients the right care, at the right time, in the right setting.
As the first ML Ops Engineer at Tennr, you’ll play a crucial role in building and iterating on foundational Machine Learning and AI systems. You’ll own building machine learning training and inference pipelines that can handle increasing traffic demands and proliferation of product surface as we grow. You will be critical in ensuring our AI-driven healthcare platform is powered by robust, scalable, and efficiently deployed models.
Our Machine Learning team owns and develops multiple in-house, proprietary VLMs, LLMs, and other models that are purpose built for the ambitious problems we are solving in the healthcare space. This is not a role where you are repackaging and wrapping old innovations, but an opportunity to be on the cutting edge of experimentation and productization of net new capabilities. You’ll make impactful contributions and influence fundamental elements of our ML and data systems, expanding Tennr’s ability to rapidly iterate and solve critical problems for patients and providers.
5+ years of experience in ML model deployment, infrastructure, and scaling in production environments
Strong software engineering fundamentals, with proficiency in Python and TypeScript
Experience in software design and architecture for highly available ML systems for use cases like inference, evaluation, and experimentation
Strong knowledge of observability, including logging, metrics, tracing, model performance monitoring, and alerting
Experience with distributed systems, reliability, and production incident response
Comfortable working in ambiguity with high ownership, moving quickly in a fast-paced startup environment, and proactively driving projects from idea to production
Nice to have:
Experience working with ML CI/CD and common ML frameworks like Pytorch, Tensorflow, etc.
Experience working with common inference frameworks like vLLM, TensorRT, Triton, etc
Experience with GPU orchestration, including managing GPU workloads/scheduling, cost management, cluster utilization, etc
Experience with GPU optimization (training/inference) involving CUDA profiling, memory optimization, multi-GPU communication, etc
Fast, Transparent Patient Experiences
Salary
$200,000 - $220,000
Location
New York, NY
Experience
6+ years
Total raised
$162.0M
Last stage
Growth
Investors
No applications, no recruiter spam. Just the intro.
A few questions to make sure this role is the right shape for you. Two minutes.
I write the intro, send it to the founder, and handle the back-and-forth.
If they’re a yes, I book the chat. You show up — that’s the whole job-hunt.