Protege is a company focused on solving the challenges of accessing the right training data for AI. The Product Manager will be responsible for managing the data supply, building a unified system for data representation, and ensuring cross-functional alignment across various teams.
Responsibilities:
- Define the data model across title, asset, and partner levels
- Establish a clear state model (e.g., ingested, accessible, in pipeline, linked/enriched)
- Own search and discovery across modalities; set metadata standards with Data Lab + Engineering
- Ensure catalog state is auditable and resilient to refreshes, moves, and deletions
- Create structured visibility into partner supply not yet ingested (potential, cadence, modality coverage, volume estimates)
- Enable GTM to scope deals based on available + accessible supply—not only what’s already in-platform
- Provide visibility into partner inclusion in deals, utilization trends, and inventory footprint
- Reduce partner back-and-forth caused by unclear system truth
- Partner with Partnerships to ensure relationships are supported by scalable system representations
- Work closely with Privacy, Rights & Trust: represent data eligibility and constraints
- Work closely with Data Access & Delivery: to ensure discoverable supply is deliverable
- Work closely with Solutions Architecture: to identify catalog gaps surfaced by deals
- Work closely with Engineering: owns ingestion execution and infrastructure
Requirements:
- PM experience in data platforms, marketplaces, catalogs/search, or supply-side systems
- Strong information architecture instincts and durable abstraction design
- Comfortable driving cross-functional alignment across Product, Engineering, Partnerships, and Data teams