
AI Research Resident
Job Description
About Polymath
Polymath is an applied research lab focused on advancing long-horizon agent capabilities through reinforcement learning. We design and scale simulation environments where agents learn to operate safely and autonomously. We work with the world’s leading model labs to push the frontier of agent capabilities. Polymath is backed by Base10, Founders Future, Y Combinator, and other incredible investors & angels. We've raised an $8M seed, and are growing out the team.
About the role
We’re looking for talented researchers currently enrolled in MS / PhD programs to collaborate on a research project focused around frontier benchmarks and environments for long-horizon AI agents. This will require 1) identifying failure modes in frontier models, 2) developing rigorous benchmarks that evaluate how well frontier agents perform on complex, realistic tasks requiring long-horizon reasoning and tool use in dynamic environments, and 3) training autonomous agents that can reason, plan, and act over extended time horizons.
We can accommodate full-time or part-time engagements. Compensation will be $200k / year prorated to the number of hours committed. The goal of the residency is to culminate in a publication, and if there is a mutual fit, transition into a full-time role. If you’re interested in joining Polymath but are not currently a student, please apply to the Member of Technical Staff role.
You’ll be a good fit if you:
- Are currently pursuing an MS or PhD program in Computer Science or a related field
- Have experience with reinforcement learning, benchmarking frontier models, or model post-training
- Have experience with systems engineering and can write production-quality code
- Have a strong track record of publications
- Have high agency, move quickly, and enjoy working on open-ended research problems
Culture
- Polymath is a team of researchers, engineers, and operators focused on advancing the frontier of safe, superintelligent AI agents.
- We have a flat organizational structure. We believe that people do their best work when they’re self-motivated and driven by a desire to learn, contribute to the team’s goals, and advance scientific progress.
- We’re looking for folks who ship fast, set high standards for themselves, and are great team players.
Optimize Your Resume for This Job
Get a match score and see exactly which keywords you're missing
Job Details
- Category
- Research
- Employment Type
- Part Time
- Location
- San Francisco, CA, US / Remote (US) (Remote Available)
- Posted
- Apr 27, 2026, 07:40 PM
- Listed
- Apr 27, 2026, 07:40 PM
- Compensation
- $75 - $120 per hour
About Polymath
Part of the growing frontier tech ecosystem pushing the edges of what's possible.
More Roles at Polymath
Similar Research Roles



Found this role interesting?