Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. In this role, you will create cutting-edge datasets for training and benchmarking large language models, collaborating closely with researchers to evaluate and refine AI-generated code for efficiency and reliability.
Responsibilities:
- Create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers
- Curate code examples, provide precise solutions, and make corrections — with a primary focus on Python across backend services, data pipelines, and ML infrastructure, alongside JavaScript (including ReactJS), C/C++, Java, Rust, and Go
- Evaluate and refine AI-generated code to ensure that it is efficient, scalable, and reliable
- Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks
- Build agents and automated verification tools in Python that can verify the quality of code and identify error patterns
- Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them
- Design verification mechanisms that can automatically verify a solution to a software engineering task
Requirements:
- Several years of software engineering experience (3 years or more)
- Strong expertise in Python with deep knowledge of frameworks, tooling, and best practices for building production-grade software
- Experience building full-stack applications and deploying scalable software using modern languages and tools
- Deep understanding of software architecture, design, development, debugging, and code quality/review assessment
- Excellent oral and written communication skills for clear, structured evaluation rationales
- Candidates must be based in the United States
- Ideal for engineers who have built production systems at companies like Google, Microsoft, Apple, Amazon, Meta, or similar high-scale engineering organizations
- Graduates from top computer science programs such as Stanford, MIT, Carnegie Mellon, UC Berkeley, Georgia Tech, and comparable institutions