Genesis AI is committed to building general-purpose Physical AI, and they are seeking a Staff Software Engineer to design and maintain large-scale data pipelines for robotics model training. The role involves owning core data infrastructure and collaborating with a dedicated team to standardize data models and unify processing pipelines.
Responsibilities:
- Design, build, and maintain large-scale data pipelines (batch and streaming) for robotics foundation model training and evaluation at petabyte scale
- Own core data infrastructure: data model, storage systems, ingestion pipelines, transformation frameworks, and orchestration layers
- Standardize data models and unify processing pipelines across real-world teleoperation and synthetic simulation datasets
- Collaborate with a team of driven individuals committed to building general-purpose Physical AI
Requirements:
- Excellent software engineering skills (Python, Go, or similar)
- Extensive experience designing, building, and maintaining large-scale data pipelines (8+ years)
- Deep understanding of distributed systems (Spark, Kafka, or similar)
- Extensive experience with data storage technologies (data lakes, warehouses, object stores like S3)
- Experience running and maintaining production-grade infrastructure (Kubernetes, Terraform)
- Experience supporting AI systems, in particular embodied AI like self-driving