Quik Hire Staffing is looking for a highly skilled technical expert to evaluate and improve AI-generated responses across software engineering, data science, and systems design topics. The role involves assessing model outputs for accuracy and clarity, and providing structured feedback to enhance AI performance.

Responsibilities:

Evaluate LLM-generated responses to coding, software engineering, data science, and systems design questions
Verify factual accuracy using reliable public sources and authoritative references
Execute code and validate outputs to confirm correctness and reproducibility
Review model responses for strengths, weaknesses, logical flaws, and conceptual errors
Assess code quality, readability, algorithmic soundness, and explanation quality
Ensure responses follow expected conversational behavior and evaluation guidelines
Apply consistent review standards using clear benchmarks and taxonomies
Produce structured feedback that helps improve AI model performance

Requirements:

Strong expertise in at least two programming languages
Ability to solve medium and hard technical problems independently
Experience contributing to open-source projects or production codebases
Strong familiarity with using LLMs in coding workflows
Excellent attention to detail and the ability to identify subtle bugs or reasoning issues
Strong written communication skills for technical evaluation and feedback
Prior experience with AI evaluation, RLHF, or data annotation
Competitive programming experience
Experience reviewing code in production environments
Familiarity with multiple programming paradigms or ecosystems
Ability to explain complex technical concepts clearly to non-expert audiences

Software Engineer (Data Science & Systems Design) - Remote

Key skills

About this role

Responsibilities:

Requirements: