Preference Model is focused on building automated ML research engineering to enhance AI capabilities. As a Member of Technical Staff - Software Engineer on the Capabilities team, you will design and build software engineering environments that challenge AI models and help them improve in complex coding tasks.
Responsibilities:
- Design, build, and refine RL tasks across their full lifecycle, from ideation through grading, failure analysis, and iteration
- Own the hardest environments on the roadmap: multi-step workflows, realistic stakeholder interactions, large codebases with real conventions and technical debt, and system design problems
- Direct coding agents heavily in your day-to-day work, evaluate their output critically, and recognize when they are failing in subtle ways
- Distinguish genuine model capability gaps from grader or environment issues, and redesign tasks to target deeper, more subtle failure modes
- Contribute to the shared infrastructure and tooling that the environments team depends on
- Mentor newer engineers on the team as it grows