Level AI is a fast-growing Series C AI company focused on building production-grade Agentic AI systems. They are seeking a Research Intern specializing in Reinforcement Learning to design and build RL environments and agents that model real-world customer interactions, utilizing real-world data and feedback loops.
Responsibilities:
- Design and build reinforcement learning environments that model real-world customer interaction workflows
- Design RL agents that learn from these environments using real-world interaction data, rewards, and feedback loops
- Define reward models and feedback loops using real-world signals (outcomes and human feedback)
- Enable learning from production data by structuring interaction traces into training-ready datasets for offline and online learning
- Experiment with multi-agent systems and simulation frameworks for complex coordination and decision-making
- Collaborate with engineering and product teams to deploy, evaluate, and iterate on learning systems in production at scale