Inference.net is a company that trains and hosts specialized language models for businesses seeking high-quality AI solutions. The Machine Learning Researcher will conduct research into experimental models and training systems, aiming to develop novel products and improve model performance for customers.
Responsibilities:
- Research and experiment with new model architectures to improve quality, efficiency, or capability
- Explore methods to decrease inference latency and improve serving efficiency
- Run experiments with new learning methods, including novel approaches to SFT, RLHF, DPO, and other post-training techniques
- Perform reinforcement learning research to improve model alignment and capability
- Develop and improve our distillation pipeline for training high-quality models from frontier teachers
- Train models for clients and run evaluations to validate research findings in production settings
- Create robust benchmarks and evaluation frameworks that ensure custom models match or exceed frontier performance
- Stay current with ML research and identify techniques that can improve our platform
- Collaborate with applied engineers to bring successful research into production systems
- Document findings and share knowledge with the team