Apollo Research is focused on the risks associated with AI systems and is looking for Research Scientists and Research Engineers to develop a hard science around the 'Science of Scheming'. The role involves collaborating with AI developers, studying reinforcement learning dynamics, and developing novel evaluation techniques for AI cognition.

Responsibilities:

Collaborate with leading AI developers
Deeply study the RL dynamics that lead to the emergence of reward-seeking, evaluation awareness or misaligned preferences
Design and train model organisms, and scale your insights to frontier systems
Work towards 'Scaling laws of scheming'
Build the empirical foundations to predict how scheming risks evolve as models scale in capability
Develop novel and ambitious evaluation techniques that have a chance of scaling to highly evaluation aware models
Deep dive into AI cognition
Discover patterns in the reasoning processes of frontier AI systems that no one else has ever observed before

Research Scientist/Engineer (Science of Scheming)

Key skills

About this role

Responsibilities: