Job Application for Research, Vision Expertise at Thinking Machines Lab Back to jobs Research, Vision Expertise San Francisco Apply Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has access to the knowledge and tools to make AI work for their unique needs and goals. We are scientists, engineers, and builders who’ve created some of the most widely used AI products, including ChatGPT and Character.ai, open-weights models like Mistral, as well as popular open source projects like PyTorch, OpenAI Gym, Fairseq, and Segment Anything.
About
the Role Thinking Machines builds multimodal-first. We’re looking for new team members to advance the science of visual perception and multimodal learning. We think about how vision and language interact at scale. We design architectures that fuse pixels and text, build datasets and evaluation methods that test real-world comprehension, and develop representations that let models ground abstract concepts in the physical world. Our goal is to create multimodal systems that support seamless integration into real-world environments. You’ll work at the intersection of visual understanding, multimodal reasoning, and large-scale model training. You’ll help develop the architectures, data, and evaluation tools that teach AI to see, understand, and collaborate. The best candidate is curious about multimodal interfaces, has experience running large scale experiments and is comfortable contributing to complex engineering systems. While we are looking for a person with expertise in multimodality, Thinking Machines Lab operates in a unified fashion and expects new hires to work across modalities as one team. This role blends fundamental research and practical engineering, as we do not distinguish between the two roles internally. You will be expected to write high-performance code and read technical reports. It’s an excellent fit for someone who enjoys both deep theoretical exploration and hands-on experimentation, and who wants to shape the foundations of how AI learns. Note: This is an "evergreen role" that we keep open on an on-going basis to express interest in this research area. We receive many applications, and there may not always be an immediate role that aligns perfectly with your experience and skills. Still, we encourage you to apply. We continuously review applications and reach out to applicants as new opportunities open. You are welcome to reapply if you get more experience, but please avoid applying more than once every 6 months. You may also find that we put up postings for singular roles for separate, project or team specific needs. In those cases, you're welcome to apply directly in addition to an evergreen role. What You’ll Do Own research projects on training and performance analysis of multimodal AI models. Curate and build large-scale datasets and evaluation benchmarks to advance vision capabilities. Work with our data infrastructure engineers, pretraining researchers and engineers, and product team to create frontier multimodal models and the products that leverage them. Publish and present research that moves the entire community forward. Share code, datasets, and insights that accelerate progress across industry and academia. Skills and
Qualifications Minimum
qualifications: Ability to design, run, and analyze experiments thoughtfully, with demonstrated research judgment and empirical rigor. Understanding of machine learning fundamentals, large-scale training, and distributed compute environments. Proficiency in Python and familiarity with at least one deep learning framework (e.g., PyTorch, TensorFlow, or JAX). Comfortable with debugging distributed training and writing code that scales. Bachelor’s degree or equivalent experience in Computer Science, Machine Learning, Physics, Mathematics, or a related discipline with strong theoretical and empirical grounding. Clarity in communication, an ability to explain complex technical concepts in writing. Preferred
qualifications — we encourage you to apply even if you don’t meet all preferred
qualifications, but at least some: Research or engineering contributions in visual reasoning, spatial understanding, or multimodal architecture design. Experience developing evaluation frameworks for multimodal tasks. Publications or open-source contributions in vision-language modeling, video understanding, or multimodal AI. A strong grasp of probability, statistics, and ML fundamentals. You can look at experimental data and distinguish between real effects, noise, and bugs. PhD in Computer Science, Machine Learning, Physics, Mathematics, or a related discipline with strong theoretical and empirical grounding; or, equivalent industry research experience. Logistics Location: This role is based in San Francisco, California.
Compensation: Depending on background, skills and experience, the expected annual salary range for this position is $350,000 - $475,000 USD. Visa sponsorship: We sponsor visas. While we can't guarantee success for every candidate or role, if you're the right fit, we're committed to working through the visa process together.
Benefits: Thinking Machines offers generous health, dental, and vision
benefits, unlimited PTO, paid parental leave, and relocation support as needed. As set forth in Thinking Machines' Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law. Create a Job Alert Interested in building your career at Thinking Machines Lab? Get future opportunities sent straight to your email. Create alert Apply for this job * indicates a required field Autofill with MyGreenhouse First Name * Last Name * Preferred First Name Email * Phone Country * Phone * Location (City) * Locate me Resume/CV * Attach Attach Dropbox Google Drive Enter manually Enter manually Accepted file types: pdf, doc, docx, txt, rtf Education School * Select... Degree * Select... Discipline * Select... Start date year End date year Add another LinkedIn Profile Link * Please provide the URL to your LinkedIn; if you don't have one, please write "none". Github Link * Please provide the URL to your Github; if you don't have one, please write "none". Personal Website About You * Please provide the URL to your personal website, Google Scholar, etc if you have one. Put "none" if you do not. Current Company * Please tell us the name of your current employer (today if you are employed). Put "none" if this does not apply to you; for example, if you are in school or not currently employed -- this does not disqualify you. Feel free to enter previous roles in the field below in "Past Company 1". Current Title or Role * Please enter your current title at your current employer. If you are not currently employed (or in school etc) please enter "none" and feel free to enter previous roles in the field below in "Past Company". Past Company 1 * Please enter the Company name of your most recent previous employer. If you have not worked at another company before your current one, please enter “none”. Past Company Title or Role * Please enter your title at your most recent previous employer. If you have not worked at another company before your current one, please enter “none”. Past Company 2 If you would like, please enter the Company name of your second previous employer. If you have not worked at another company before your current or previous one, please enter “none” or skip this question. Past Company Title or Role 2 If you would like, please enter the Title or Job of your second previous employer. If you have not worked at another company before your current or previous one, please enter “none” or skip this question. What domains of research do you have expertise in? * Pre-training Data Pre-training Science Pre-training Scaling Laws Architecture Optimization Multimodal Audio Perception Vision Robotics Post-Training Post-Training Data Human Data Synthetic Data Reinforcement Learning Reasoning Safety Alignment Other Select all that apply where you have actively completed work in and would be able to interview for in a technical interview. This will help us when picking between projects If you selected other, what areas did we not include that you have expertise in? * If you completed research under an advisor, such as through a PhD program or masters program where you published, who was your advisor? First name and last name of your Advisor / what program this was Do you have any links to publications we should read? Links to any publications, please list here (Optional) List 3 projects you're proud of. Please list 3 projects you're proud of, using 1 sentence each. Feel free to add a link if helpful. We will send this to the hiring team when reviewing your application. Will you now or in the future require sponsorship for employment visa status in the United States? * Select... (Optional) Other notes Voluntary Self-Identification For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file. As set forth in Thinking Machines Lab’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law. Gender Select... Are you Hispanic/Latino? Select... Race & Ethnicity Definitions If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows: A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to
compensation (or who but for the receipt of military retired pay would be entitled to
compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability. A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service. An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense. An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985. Veteran Status Select... Voluntary Self-Identification of Disability Form CC-305 Page 1 of 1 OMB Control Number 1250-0005 Expires 04/30/2026 Why are you being asked to complete this form? We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years. Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp . How do you know if you have a disability? A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to: Alcohol or other substance use disorder (not currently using drugs illegally) Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS Blind or low vision Cancer (past or present) Cardiovascular or heart disease Celiac disease Cerebral palsy Deaf or serious difficulty hearing Diabetes Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders Epilepsy or other seizure disorder Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome Intellectual or developmental disability Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD Missing limbs or partially missing limbs Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS) Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities Partial or complete paralysis (any cause) Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema Short stature (dwarfism) Traumatic brain injury Disability Status Select... PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete. Submit application Powered by Greenhouse
Salary
$350,000 - $475,000
Location
San Francisco
Total raised
$2.0B
Last stage
Seed
Investors
Mira Murati
Founder and CEO
John Schulman
Cofounder
Barret Zoph
VP of Research
No applications, no recruiter spam. Just the intro.
A few questions to make sure this role is the right shape for you. Two minutes.
I write the intro, send it to the founder, and handle the back-and-forth.
If they’re a yes, I book the chat. You show up — that’s the whole job-hunt.