Apollo Research is focused on the risks associated with AI systems and is looking for Research Scientists and Research Engineers to develop a hard science around the 'Science of Scheming'. The role involves collaborating with AI developers, studying reinforcement learning dynamics, and developing novel evaluation techniques for AI cognition.
Responsibilities:
- Collaborate with leading AI developers
- Deeply study the RL dynamics that lead to the emergence of reward-seeking, evaluation awareness or misaligned preferences
- Design and train model organisms, and scale your insights to frontier systems
- Work towards 'Scaling laws of scheming'
- Build the empirical foundations to predict how scheming risks evolve as models scale in capability
- Develop novel and ambitious evaluation techniques that have a chance of scaling to highly evaluation aware models
- Deep dive into AI cognition
- Discover patterns in the reasoning processes of frontier AI systems that no one else has ever observed before