The UVA VEC is focused on a high-impact AI initiative aimed at improving the reliability and performance of conversational systems in software engineering contexts. They are seeking an experienced software engineer with expertise in C# to evaluate AI-generated code and ensure high-quality model behavior through detailed analysis and feedback.

Responsibilities:

Evaluate AI-generated responses to coding and software engineering tasks for accuracy and reasoning quality
Execute code to validate outputs and verify functional correctness
Analyze algorithm design, efficiency, and edge case handling
Annotate responses with detailed feedback on strengths and deficiencies
Identify logical errors, bugs, and inconsistencies in generated code
Assess clarity and completeness of technical explanations
Apply standardized evaluation frameworks, taxonomies, and benchmarks
Ensure outputs align with expected conversational and engineering standards

Requirements:

Bachelor's, Master's, or PhD in Computer Science or a related field
5+ years of professional experience in software engineering or similar roles
Advanced proficiency in C# and strong understanding of software design principles
Ability to independently solve medium to hard-level algorithmic problems
Experience executing and debugging code across real-world scenarios
Strong analytical skills for evaluating logic, performance, and correctness
Familiarity with large language models and their practical limitations
High attention to detail and structured evaluation approach
Fluent English communication skills
Experience contributing to open-source projects with accepted pull requests
Exposure to model evaluation, RLHF, or data annotation workflows
Background in competitive programming or technical assessments
Experience reviewing production-level codebases
Familiarity with multiple programming paradigms or ecosystems
Ability to clearly explain technical concepts to non-technical audiences

Senior C# Software Engineer (AI Evaluation)

Key skills

About this role

Responsibilities:

Requirements: