Protege is building a platform to facilitate the secure and efficient exchange of AI training data. The Product Manager will own the representation and discovery of data supply, build an internal data catalog, and ensure cross-functional alignment to improve data accessibility and partner relationships.
Responsibilities:
- Define the data model across title, asset, and partner levels
- Establish a clear state model (e.g., ingested, accessible, in pipeline, linked/enriched)
- Own search and discovery across modalities; set metadata standards with Data Lab + Engineering
- Ensure catalog state is auditable and resilient to refreshes, moves, and deletions
- Create structured visibility into partner supply not yet ingested (potential, cadence, modality coverage, volume estimates)
- Enable GTM to scope deals based on available + accessible supply—not only what’s already in-platform
- Provide visibility into partner inclusion in deals, utilization trends, and inventory footprint
- Reduce partner back-and-forth caused by unclear system truth
- Partner with Partnerships to ensure relationships are supported by scalable system representations
- Work closely with: Privacy, Rights & Trust: represent data eligibility and constraints, Data Access & Delivery: to ensure discoverable supply is deliverable, Solutions Architecture: to identify catalog gaps surfaced by deals, Engineering: owns ingestion execution and infrastructure