Quik Hire Staffing is looking for a highly skilled technical expert to evaluate and improve AI-generated responses across software engineering, data science, and systems design topics. The role involves assessing model outputs for accuracy and clarity, and providing structured feedback to enhance AI performance.
Responsibilities:
- Evaluate LLM-generated responses to coding, software engineering, data science, and systems design questions
- Verify factual accuracy using reliable public sources and authoritative references
- Execute code and validate outputs to confirm correctness and reproducibility
- Review model responses for strengths, weaknesses, logical flaws, and conceptual errors
- Assess code quality, readability, algorithmic soundness, and explanation quality
- Ensure responses follow expected conversational behavior and evaluation guidelines
- Apply consistent review standards using clear benchmarks and taxonomies
- Produce structured feedback that helps improve AI model performance
Requirements:
- Strong expertise in at least two programming languages
- Ability to solve medium and hard technical problems independently
- Experience contributing to open-source projects or production codebases
- Strong familiarity with using LLMs in coding workflows
- Excellent attention to detail and the ability to identify subtle bugs or reasoning issues
- Strong written communication skills for technical evaluation and feedback
- Prior experience with AI evaluation, RLHF, or data annotation
- Competitive programming experience
- Experience reviewing code in production environments
- Familiarity with multiple programming paradigms or ecosystems
- Ability to explain complex technical concepts clearly to non-expert audiences