San Francisco, California, United States of America
Full Time
2 hours ago
$230,000 - $260,000 USD
H1B Sponsor
Key skills
AIMLLLMAgenticECSDecision MakingCollaborationRemote Work
About this role
Role Overview
Own the product vision, strategy, and roadmap for the task and Expert Contributor Evaluations product supporting internal and external platform users.
Identify bottlenecks and lead cross-team efforts to improve overall quality and customer acceptance rate.
Lead Evaluation products and tools across research, engineering, operations, and forward deployed engineers to drive excellence in product quality.
Define, build, and set up continuous monitoring for performance metrics of Evaluations to improve their adaption, usage, and effectiveness.
Own the strategy and execution of Expert Contributor Quality
Build a recommendation system for optimal matching of ECs to tasks on the Snorkel Platform
Work with cross-functional stakeholders to support enablement and adaptation initiatives of the Evaluations and Quality products
Balance short-term delivery with long-term platform investments to support future growth.
Requirements
5–7 years of experience as a Product Manager, with ownership of complex, cross-functional product areas.
Educational background in computer science or related engineering practice
Strong technical literacy across ML models, Gen AI concepts, LLM-based products, Agentic designs, evaluation, labeling strategies, and quality frameworks.
Proven ability to drive deeply technical roadmaps end-to-end, from concept to launch.
Ability to write clear PRDs, partner on research experimentation frameworks, and drive measurable outcomes.
Experience building internal or platform-level products with complex workflows and multi-stakeholder environments.
Proven ability to own end-to-end product experiences across multiple user personas.
Strong analytical and problem-solving skills, with a track record of metrics-driven decision making.
Excellent collaboration skills and experience partnering closely with Engineering, Research, and Operations teams.