Probably Genetic is changing the lives of patients living with severe, complex diseases. They are seeking a Senior Data Scientist to own the development and operationalization of predictive diagnostic AI models, enhancing patient diagnosis and program efficiency.
Responsibilities:
- Own the end-to-end development, validation, and operationalization of PG's predictive diagnostic AI models — from feature engineering through production deployment – that power program eligibility decisions and clinical decisions for patients
- Run prospective testing experiments: apply diagnostic models to undiagnosed patients, coordinate testing, and track outcomes to continuously improve model performance
- Build and maintain PG's synthetic patient data pipeline, a critical deliverable for our research programs, and key input to our own model development lifecycle
- Optimize our patient intake experience using NLP and multimodal data analysis to determine which questions to ask, in what order, to maximize data quality and conversion
- Own API usage and cost optimization across PG's AI stack, including prompt engineering, model evaluation, and ongoing performance monitoring
- Conduct ad hoc strategic analyses that inform product prioritization, causality assessment, and generate customer-facing program insights
- Establish MLOps infrastructure: model monitoring, drift detection, API observability, and lightweight but durable operational processes
- Have the freedom to conduct blue sky research initiatives aimed at creating value from our data
- Work with Data Engineering to build a robust, scalable data foundation that supports all of the above