SID.ai is a research lab focused on search, training models that can retrieve and reason over any data source. They are seeking a Research Engineer to train models, design RL training environments, and manage the training pipeline while working closely with the research CEO.
Responsibilities:
- Train models with GRPO
- Design and iterate RL training environments for retrieval – unstructured, structured, web
- Own the entire training pipeline: from training data curation to wandb
- Run small and large model experiments – yolo runs encouraged
- Work on next-generation vision-first embedding models
- Lead discussions on research – reading group
- Work directly with the ex-research CEO
- Future: Manage a team of research engineers