Alignerr is assembling an elite team of senior professionals to stress-test frontier AI models. The role requires deep domain knowledge to craft expert-level prompts and evaluate AI responses, aiming to improve model performance through rigorous challenges.
Responsibilities:
- Design sophisticated, expert-level prompts that test the boundaries of frontier AI models in your domain
- Evaluate AI-generated responses for factual accuracy, depth, nuance, and reasoning quality
- Identify subtle errors, hallucinations, and gaps that only a true domain expert would catch
- Provide detailed, structured feedback to help improve model performance
- Develop novel and adversarial test cases that expose model weaknesses
- Collaborate asynchronously with a global network of top-tier experts across disciplines
- Work independently on your own schedule with full autonomy over your tasks
Requirements:
- 5+ years of professional or academic experience in any domain — the field doesn't matter, the depth does
- Demonstrated track record of expert-level work (publications, senior roles, advanced degrees, industry recognition, or equivalent)
- Exceptional critical thinking and analytical skills
- Ability to articulate complex, domain-specific concepts with precision
- Strong written communication skills in English
- Intellectual curiosity and a genuine interest in how AI handles your area of expertise
- Comfortable working independently in a remote, asynchronous environment
- No prior AI or prompt engineering experience required — your domain mastery is what matters
- Experience with AI tools, large language models, or prompt engineering
- Background in teaching, mentoring, or explaining complex topics to non-experts
- Published research, patents, or recognized thought leadership in your field
- Experience in quality assurance, peer review, or editorial work