Cartesia is pioneering the next generation of AI with a focus on interactive intelligence across various media. They are seeking a Research Engineer to ensure the quality and coverage of data for their models, emphasizing the creation of inclusive and representative datasets.
Responsibilities:
- Design and build large-scale datasets for model training, and run controlled modeling experiments to measure their impact on model performance and behavior
- Build evaluations of speech models, both via manual annotation and at scale with automated metrics
- Implement techniques for steering data generation to improve model intelligence through data and mitigate bias
- Build automated quality control systems to validate and filter generated data
- Partner with product teams to ensure support for key languages and markets