Thinking Machines Lab is focused on advancing collaborative general intelligence and empowering humanity through AI. They are seeking a Research Infrastructure Engineer to build and maintain tools that accelerate research, ensuring that researchers have reliable systems that enhance their work.
Responsibilities:
- Design, build, and operate research infrastructure including evaluation frameworks, RL training systems, experiment tracking platforms, visualization tools, and shared utilities
- Develop high-throughput, scalable pipelines for distributed evaluation, reward modeling, and multimodal assessment
- Build systems for reproducibility, traceability, and robust quality control across research experiments and model training runs. Implement monitoring and observability
- Partner directly with researchers to identify bottlenecks and unlock new capabilities. Own research tooling like a product manager, proactively seeking feedback and tracking adoption
- Collaborate with infrastructure, data, and product teams to integrate tools across the technical stack