About this role

Conduct fundamental LLM research using our SOTA story engine.
Create a benchmark for evaluating LLM behavior by defining tasks, success criteria, and auxiliary metrics within our story engine.
Deliver a benchmark library for performing evaluations on SOTA LLMs and a written report of your compiled results for posting to our blog and potential publication.

Current grad student in computer science or similar
Previous publications or research projects creating LLM benchmarks
Demonstrated software engineering experience via open-source projects or employment
Evidence of written communication skills via previous reports or publications

Medical, dental, and vision coverage, with the company paying 100% of premiums for employees and eligible dependents (subject to plan terms and eligibility; benefits vary by location).
Company-provided laptop (Mac by default) and the tools you need to do your best work.
Flexible PTO, encouraged and supported. Many teammates take around 2 days per month plus additional time each quarter, depending on role and team planning.
Access to the latest AI models to accelerate your work.
$100/month Learning Fund for books, courses, coaching, conferences, and video games.

AI Research Intern

Key skills