Centific is a frontier AI data foundry that empowers clients with safe and scalable AI deployment. The Senior Product Manager for LLM Post-Training & Evaluation will own the product strategy and roadmap, translating complex customer requirements into a coherent product vision while collaborating with cross-functional teams to drive delivery and market execution.

Responsibilities:

Define and own the product roadmap for LLM post-training and evaluation capabilities — including benchmark infrastructure, evaluation-as-a-service offerings, post-training workflow tooling, and model quality platforms
Conduct deep discovery with AI labs, enterprise ML teams, and foundation model builders to understand evaluation pain points, post-training needs, and quality measurement gaps
Translate complex technical and business requirements into clear, actionable product specifications
Make rigorous prioritization decisions across a broad research-and-product backlog, balancing customer urgency, strategic value, and engineering feasibility
Partner with Research Scientists, AI/ML Research Engineers, and Language Data Scientists to drive delivery from concept to production
Lead sprint planning, milestone tracking, and cross-team alignment without direct authority
Partner with Sales, Solutions Engineering, and Marketing to develop positioning, packaging, and pricing for evaluation and post-training offerings
Create compelling product narratives and technical collateral for customer-facing engagements
Serve as the product voice in customer conversations with technical stakeholders at leading AI organizations
Gather feedback, validate hypotheses, and iterate rapidly on platform capabilities
Define and track product KPIs for adoption, quality, and customer satisfaction
Use data to drive product decisions and communicate progress to executive leadership
Ensure that evaluation frameworks, benchmark datasets, and post-training pipelines are built as reusable, scalable platform assets rather than one-off solutions

Requirements:

4–5 years of product management experience, with 3+ years in AI/ML, data platform, or developer tools products
Strong understanding of ML concepts, LLM training pipelines, and evaluation methodologies. Able to engage credibly with research scientists and engineers on technical tradeoffs without being the deepest technical expert in the room
BS/MS in Computer Science, Engineering, Statistics, or a related technical field. Equivalent applied experience considered
Demonstrated success taking AI/ML products from early concept through customer adoption, including defining MVPs, iterating on feedback, and scaling to broader use
Strong customer discovery and requirements engineering skills; experience translating ambiguous, complex needs into structured specs and clear prioritization
Proven ability to lead without direct authority — aligning research, engineering, design, and go-to-market teams toward shared outcomes
Exceptional written and verbal communication skills; able to write crisp PRDs, present to executives, and hold your own in deep technical discussions with research scientists
Prior experience as an ML engineer, data scientist, or research engineer before moving into product management
Hands-on experience designing or running LLM evaluation studies, benchmark datasets, or quality measurement frameworks
Exposure to fine-tuning or post-training workflows (SFT, RLHF, preference optimization) in a research or applied setting
Experience working at or closely with AI labs, foundation model companies, or enterprise AI infrastructure providers
Familiarity with the research literature in LLM evaluation, alignment, and post-training; able to read and critically assess papers at top venues (NeurIPS, ICML, ICLR, ACL, EMNLP)
Experience with platform-as-a-service or API-first products in the B2B enterprise space
Familiarity with responsible AI, model governance, and compliance considerations relevant to enterprise GenAI deployments

Senior Product Manager, LLM Post-Training & Evaluation

Key skills

About this role

Responsibilities:

Requirements: