OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. They are seeking exceptional researchers to join the Post-Training Frontiers team, responsible for enhancing agentic models through impactful improvements and collaborations.

Responsibilities:

Own end-to-end research and engineering projects that improve the final post-training of OpenAI’s agentic models
Decide, together with partner teams, which integrations are ready for inclusion in major model runs
Develop horizontal model improvements across factuality, instruction following, tool/function calling, multi-agent behavior, reasoning-effort calibration, and other broad capabilities
Build and improve training, evaluation, grading, and data infrastructure for large-scale RL/post-training runs
Create evals and diagnostics that help us understand whether a model is ready to ship
Improve the feedback loop from real product usage into post-training, including better ways to learn from implicit user feedback
Collaborate closely with Codex, API, ChatGPT, product, training, and other post-training teams to make frontier models more useful, reliable, and agentic

Researcher, Agentic Post-Training

Key skills

About this role

Responsibilities: