Polymath is an applied research lab focused on advancing long-horizon agent capabilities through reinforcement learning. They are seeking a Member of Technical Staff - Research to help advance the frontier of autonomous agents, working on core research problems and developing advanced simulation environments.
Responsibilities:
- Work on core research problems in long-horizon evaluation, agent post-training, and environment design
- Build benchmarks
- Shape environments
- Write production code
- Run rigorous experiments
- Develop an advanced environment simulation engine for training & evaluating autonomous AI agents
- Develop state of the art benchmarks that challenge frontier models
- Post-training agents in complex simulation environments
- Publish research