Home
Jobs
Saved
Resumes
Software Engineer – Model Evaluation, Benchmarking at SPREEAI | JobVerse
JobVerse
Home
Jobs
Recruiters
Companies
Pricing
Blog
Jobs
/
Software Engineer – Model Evaluation, Benchmarking
SPREEAI
Website
LinkedIn
Software Engineer – Model Evaluation, Benchmarking
United States
Full Time
9 hours ago
Apply Now
Key skills
Java
Numpy
Pandas
Python
C++
C
AI
Machine Learning
ML
NumPy
CI/CD
About this role
Role Overview
Build automated evaluation pipelines for multimodal AI models.
Benchmark diffusion models, vision systems, and generative workflows.
Validate model checkpoints and detect regressions across versions.
Develop evaluation metrics for realism, consistency, and performance.
Integrate evaluation tooling into CI/CD workflows.
Collaborate with ML researchers and infrastructure teams to ensure production readiness.
Analyze failure modes and propose evaluation strategies.
Requirements
Degree in Computer Science, AI, Engineering, or comparable combination of education and practical experience.
Strong programming skills in Python.
Familiarity with object-oriented programming (C++, Java, Python, or similar).
Strong data structures and algorithms fundamentals.
Understanding of machine learning experimentation workflows.
Experience evaluating vision or generative models.
Familiarity with HuggingFace ecosystem or open-source ML toolkits.
Experience building automated test frameworks or benchmarking tools.
Knowledge of diffusion models or multimodal architectures.
Experience with data analysis tools (NumPy, Pandas, visualization libraries).
Tech Stack
Java
Numpy
Pandas
Python
Benefits
Health insurance
Professional development opportunities
Flexible work arrangements
Apply Now
Home
Jobs
Saved
Resumes