Great Value Hiring is seeking a Software Test Engineer to design verifiers and correctness rubrics for evaluating AI agents' code. The role involves creating adversarial test cases and grading agent trajectories to enhance test quality.

Responsibilities:

Design verifiers and correctness rubrics for coding tasks
Enumerate edge cases and build adversarial test cases for agent/model evaluation
Grade agent trajectories and improve test/rubric quality through review

Requirements:

Design verifiers and correctness rubrics for coding tasks
Enumerate edge cases and build adversarial test cases for agent/model evaluation
Grade agent trajectories and improve test/rubric quality through review
~5+ years as an SDET / software test engineer at a real product organization
Write code and tests: automation frameworks (pytest, Playwright, Cypress), CI/CD (SDET preferred over manual-only QA)
Clear written communication
Familiarity with AI tools / evals is a plus

Software Test Engineer

Key skills

About this role

Responsibilities:

Requirements: