Collinear AI is a company focused on advancing AI reliability through innovative simulation environments. They are seeking Research Scientists and Research Engineers to develop high-fidelity environments and evaluation stacks for AI labs, bridging the gap between research and production engineering.
Responsibilities:
- Build Agentic Environments: Design and implement the next generation of "SimLabs", ultra-realistic, long-horizon simulation environments where agents learn to navigate ambiguity and maintain context
- Programmatic Verification: Develop rigorous, policy-aware judges and evaluations that measure genuine capability and safety beyond simple benchmarks
- Close the Loop: Design and execute high-quality post-training runs (CPT, SFT, RL) to deliver frontier performance on open-source models using curated, high-signal data
- Rapid Iteration: Debug and iterate across the full ML stack, from infrastructure to model behavior, ensuring our tools remain "command-line first" and developer-friendly
- Collaborate: Work daily with the founders and research staff to shape the roadmap and push the state-of-the-art in AI reliability