Centific is a frontier AI data foundry that empowers clients with safe and scalable AI deployment. The Senior Product Manager for LLM Post-Training & Evaluation will own the product strategy and roadmap, translating complex customer requirements into a coherent product vision while collaborating with cross-functional teams to drive delivery and market execution.
Responsibilities:
- Define and own the product roadmap for LLM post-training and evaluation capabilities — including benchmark infrastructure, evaluation-as-a-service offerings, post-training workflow tooling, and model quality platforms
- Conduct deep discovery with AI labs, enterprise ML teams, and foundation model builders to understand evaluation pain points, post-training needs, and quality measurement gaps
- Translate complex technical and business requirements into clear, actionable product specifications
- Make rigorous prioritization decisions across a broad research-and-product backlog, balancing customer urgency, strategic value, and engineering feasibility
- Partner with Research Scientists, AI/ML Research Engineers, and Language Data Scientists to drive delivery from concept to production
- Lead sprint planning, milestone tracking, and cross-team alignment without direct authority
- Partner with Sales, Solutions Engineering, and Marketing to develop positioning, packaging, and pricing for evaluation and post-training offerings
- Create compelling product narratives and technical collateral for customer-facing engagements
- Serve as the product voice in customer conversations with technical stakeholders at leading AI organizations
- Gather feedback, validate hypotheses, and iterate rapidly on platform capabilities
- Define and track product KPIs for adoption, quality, and customer satisfaction
- Use data to drive product decisions and communicate progress to executive leadership
- Ensure that evaluation frameworks, benchmark datasets, and post-training pipelines are built as reusable, scalable platform assets rather than one-off solutions
Requirements:
- 4–5 years of product management experience, with 3+ years in AI/ML, data platform, or developer tools products
- Strong understanding of ML concepts, LLM training pipelines, and evaluation methodologies. Able to engage credibly with research scientists and engineers on technical tradeoffs without being the deepest technical expert in the room
- BS/MS in Computer Science, Engineering, Statistics, or a related technical field. Equivalent applied experience considered
- Demonstrated success taking AI/ML products from early concept through customer adoption, including defining MVPs, iterating on feedback, and scaling to broader use
- Strong customer discovery and requirements engineering skills; experience translating ambiguous, complex needs into structured specs and clear prioritization
- Proven ability to lead without direct authority — aligning research, engineering, design, and go-to-market teams toward shared outcomes
- Exceptional written and verbal communication skills; able to write crisp PRDs, present to executives, and hold your own in deep technical discussions with research scientists
- Prior experience as an ML engineer, data scientist, or research engineer before moving into product management
- Hands-on experience designing or running LLM evaluation studies, benchmark datasets, or quality measurement frameworks
- Exposure to fine-tuning or post-training workflows (SFT, RLHF, preference optimization) in a research or applied setting
- Experience working at or closely with AI labs, foundation model companies, or enterprise AI infrastructure providers
- Familiarity with the research literature in LLM evaluation, alignment, and post-training; able to read and critically assess papers at top venues (NeurIPS, ICML, ICLR, ACL, EMNLP)
- Experience with platform-as-a-service or API-first products in the B2B enterprise space
- Familiarity with responsible AI, model governance, and compliance considerations relevant to enterprise GenAI deployments