Great Value Hiring is seeking a Software Test Engineer to design verifiers and correctness rubrics for evaluating AI agents' code. The role involves creating adversarial test cases and grading agent trajectories to enhance test quality.
Responsibilities:
- Design verifiers and correctness rubrics for coding tasks
- Enumerate edge cases and build adversarial test cases for agent/model evaluation
- Grade agent trajectories and improve test/rubric quality through review
Requirements:
- Design verifiers and correctness rubrics for coding tasks
- Enumerate edge cases and build adversarial test cases for agent/model evaluation
- Grade agent trajectories and improve test/rubric quality through review
- ~5+ years as an SDET / software test engineer at a real product organization
- Write code and tests: automation frameworks (pytest, Playwright, Cypress), CI/CD (SDET preferred over manual-only QA)
- Clear written communication
- Familiarity with AI tools / evals is a plus