Turing is a leading research accelerator for frontier AI labs, based in San Francisco, California. They are seeking a Software Engineer specializing in AI Research & Evaluation to create datasets for training large language models and collaborate with researchers to enhance AI-driven coding solutions.
Responsibilities:
- Create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers
- Curate code examples, provide precise solutions, and make corrections across the full stack — in Python for backend and ML workflows, and JavaScript (React, Node.js) for frontend and API layers, alongside C/C++, Java, Rust, and Go
- Evaluate and refine AI-generated code for efficiency, scalability, and reliability
- Work with cross-functional teams to enhance enterprise-level AI-driven coding solutions
- Build agents that can verify the quality of the code and identify error patterns across full-stack applications
- Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them
- Design verification mechanisms that can automatically verify a solution to a software engineering task