Talentsearchpro is seeking a Research Scientist to work on Frontier Data focusing on LLM training pipelines. The role involves designing experiments, developing evaluation rubrics, and partnering with research teams to enhance model capabilities across various domains.
Responsibilities:
- Design data slices and explore data shapes that expose meaningful model failure modes across domains, including finance, code, and enterprise workflows
- Build and refine evaluation rubrics and reward signals for RLHF and RLVR training pipelines
- Model annotator behavior and run experiments to improve different model capabilities
- Develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on model alignment and capability
- Partner with lab research teams to translate their training objectives into concrete data and evaluation specifications
- Move fast from hypothesis to experiment, extract actionable insights from messy results, and iterate quickly