OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. They are seeking a Research Engineer/Scientist to join the Future of Computing Research team, focusing on developing RLHF and post-training methods for personalized, multimodal AI systems.
Responsibilities:
- Develop RLHF and post-training methods for multimodal models
- Build reward models and preference-learning pipelines for adaptive, personalized model behavior
- Design datasets, rubrics, and evaluation frameworks that capture user preferences, contextual appropriateness, and long-term value in realistic tasks
- Run experiments on policy improvement using explicit feedback, implicit signals, and model-based grading
- Work on long-horizon evaluation problems, where model quality depends not just on a single response but on whether behavior improves outcomes over time
- Collaborate closely with safety researchers to ensure that adaptation and personalization remain aligned, interpretable, and bounded by clear constraints
- Prototype and iterate quickly on training recipes, reward formulations, data pipelines, and evaluation suites for product-relevant behaviors
- Help define how OpenAI measures success for personalized AI systems including trust, appropriateness, and long-term user benefit