Hark is an artificial intelligence company developing advanced, personalized intelligence systems. The Member of Technical Staff - Mid-Training will lead the development of training strategies to enhance model capabilities in reasoning, planning, and tool use at scale.
Responsibilities:
- Design and implement mid-training strategies to improve agent capabilities such as reasoning, planning, tool use, and long-horizon decision-making
- Scale synthetic data generation pipelines (e.g., coding, agent trajectories, multimodal data) and optimize data mixtures to improve downstream RL performance
- Build and optimize distributed training pipelines for large models, ensuring efficiency, stability, and scalability across GPU clusters
- Develop and iterate on evaluation frameworks to measure model capability (e.g., task success, reasoning quality, tool use accuracy) and guide training improvements
- Conduct rigorous experimentation and ablations to understand training dynamics, scaling behavior, and bottlenecks
- Collaborate cross-functionally with pre-training, post-training, and product teams to align model development with real-world agent use cases
- Drive technical innovation in areas such as long-context learning, data distillation, and training efficiency, while contributing to the overall model roadmap