Stealth Mode Startup is an early-stage technology company focused on building a real-time simulation platform using multi-agent AI and machine learning. The Computer Vision Engineer will own and improve the pipeline that extracts 3D pose and trajectory data from basketball broadcast footage, ensuring the accuracy and reliability of the training data for downstream models.
Responsibilities:
- You'll own the pipeline that turns raw basketball broadcast footage into clean, accurate 3D pose and trajectory data
- That means fine-tuning and improving our HMR 2.0-based pose estimation, optimizing player detection and tracking, building or training specialized models for court keypoint detection, and refining our homography estimation to accurately place 3D pose data onto the court
- You'll be responsible for the accuracy and reliability of every piece of training data that feeds our downstream diffusion models
- When extraction quality degrades—occlusions, camera cuts, unusual angles—you'll diagnose the problem and fix it, whether that means fine-tuning an existing model, training a new one, or engineering around the issue