About this roleAbout the team
As a core member of our Seed Global Data Team, you'll be at the heart of our model training. Gain first-hand experience in understanding the intricacies of training Large Language Models (LLMs) with diverse data sets.
Responsibilities
- Collaborate with technical teams to precisely define the data requirements necessary for aligning Large Language Models (LLMs) across global markets.
- Develop methodologies for data acquisition and production, while also monitoring costs and assessing the effectiveness and efficiency of these processes. Ensure data quality is maintained at high standards and continually enhance processes based on both quantitative and qualitative feedback.
- Evaluate the impact of data production tools on the effectiveness and quality of data production. Work closely with technical teams to enhance both the quality and effectiveness of these tools.
- Proactively identify and mitigate potential biases and vulnerabilities in the data production process. This includes, but is not limited to, addressing human and system biases through the implementation of technical, product, and operational solutions.
The base salary range for this position in the selected city is $165120 - $414000 annually.