Russell Tobin is seeking a Software Engineer to work with large machine learning datasets. The role involves building software pipelines and backend services to enhance data exploration and visualization of curated datasets.
Responsibilities:
- Study large machine learning datasets and build software pipelines to curate machine learning datasets generated from various data collections
- Build backend services to enhance data exploration over the curated machine learning datasets
- Build data visualization solutions (videos, 3d assets, etc.) to represent data samples with improved visibility and reasonable rendering cost
Requirements:
- Proficiency with programming and query languages such as Python and SQL
- Proficiency with data pipeline, modeling, database and query tools, like Dagster, PostgreSQL, MongoDB, Trino or equivalent
- Experiences with visualizations that enhances data exploration. Preferably data visualizations of video data and 3d point clouds/meshes
- Preferably experiences with machine learning training/validation/QA workflow
- Preferably experiences with model debugging workflow. Able to understand what analytics of datasets are typically sought after, during model debugging
- 2+ years of experiences
- B.S. in Computer Science and/or extra experiences in equivalent field