OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. They are seeking exceptional researchers to join the Post-Training Frontiers team, responsible for enhancing agentic models through impactful improvements and collaborations.
Responsibilities:
- Own end-to-end research and engineering projects that improve the final post-training of OpenAI’s agentic models
- Decide, together with partner teams, which integrations are ready for inclusion in major model runs
- Develop horizontal model improvements across factuality, instruction following, tool/function calling, multi-agent behavior, reasoning-effort calibration, and other broad capabilities
- Build and improve training, evaluation, grading, and data infrastructure for large-scale RL/post-training runs
- Create evals and diagnostics that help us understand whether a model is ready to ship
- Improve the feedback loop from real product usage into post-training, including better ways to learn from implicit user feedback
- Collaborate closely with Codex, API, ChatGPT, product, training, and other post-training teams to make frontier models more useful, reliable, and agentic