About this role

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. They are seeking a Research Engineer/Scientist to join the Future of Computing Research team, focusing on developing RLHF and post-training methods for personalized, multimodal AI systems.

Responsibilities:

Develop RLHF and post-training methods for multimodal models
Build reward models and preference-learning pipelines for adaptive, personalized model behavior
Design datasets, rubrics, and evaluation frameworks that capture user preferences, contextual appropriateness, and long-term value in realistic tasks
Run experiments on policy improvement using explicit feedback, implicit signals, and model-based grading
Work on long-horizon evaluation problems, where model quality depends not just on a single response but on whether behavior improves outcomes over time
Collaborate closely with safety researchers to ensure that adaptation and personalization remain aligned, interpretable, and bounded by clear constraints
Prototype and iterate quickly on training recipes, reward formulations, data pipelines, and evaluation suites for product-relevant behaviors
Help define how OpenAI measures success for personalized AI systems including trust, appropriateness, and long-term user benefit

Research Engineer/Scientist - Human Alignment, Consumer Devices

Key skills

About this role

Responsibilities: