































































About Polymath
Polymath is an applied research lab focused on advancing long-horizon agent capabilities through reinforcement learning. We design and scale simulation environments where agents learn to operate safely and autonomously. We work with the world’s leading model labs to push the frontier of agent capabilities. Polymath is backed by Base10, Founders Future, Y Combinator, and other incredible investors & angels. We've raised an $8M seed, and are growing out the team.
About the role
We’re looking for talented researchers currently enrolled in MS / PhD programs to collaborate on a research project focused around frontier benchmarks and environments for long-horizon AI agents. This will require 1) identifying failure modes in frontier models, 2) developing rigorous benchmarks that evaluate how well frontier agents perform on complex, realistic tasks requiring long-horizon reasoning and tool use in dynamic environments, and 3) training autonomous agents that can reason, plan, and act over extended time horizons.
We can accommodate full-time or part-time engagements. Compensation will be $200k / year prorated to the number of hours committed. The goal of the residency is to culminate in a publication, and if there is a mutual fit, transition into a full-time role. If you’re interested in joining Polymath but are not currently a student, please apply to the Member of Technical Staff role.
You’ll be a good fit if you:
Ready to apply? Let us help you stand out.
Culture
We’re heading towards a future where AI agents will be able to perform useful work over long horizons, with little or no human supervision. To increase the reliability, performance, and safety of autonomous agents, they must be trained in simulation environments that reflect the real world. Polymath builds simulated worlds for agents to practice and learn through experience.
We're a team of researchers and engineers from UC Berkeley, Hume AI, Plaid, and Amazon. We have years of experience post-training frontier models in industry, and building large scale data systems. Polymath is backed by Y Combinator.
Salary
$75 - $120
Location
Remote