Blue River Technology is a team of innovators aiming to transform agriculture through intelligent machinery. They are seeking a Data Engineer to manage data pipelines and support the ML development lifecycle, collaborating with various teams to enhance agricultural solutions.
Responsibilities:
- Own data pipelines end-to-end — ingestion, quality, performance, and reliability
- Optimise queries and shape data architecture across the ML development lifecycle
- Build and maintain ETL workflows that feed annotation, exploration, and model training
- Support infrastructure scalability across AWS (S3, DynamoDB, Lambda, SQS, and more)
- Improve code quality through automation, testing, and peer reviews
- Collaborate directly with roboticists, backend engineers, and platform teams
Requirements:
- 2+ years in backend engineering / data infrastructure
- Strong Python skills
- Hands-on experience with ETL pipelines and data architecture
- Solid grasp of relational and non-relational databases
- AWS services (S3, DynamoDB, EC2, Lambda, ECR, SQS, SNS)
- Docker + CI/CD (GitHub Actions or Jenkins)
- Clear communicator who documents well and works cross-functionally
- BS or MS Computer Science
- Databricks / Apache Spark
- Kubernetes in production
- Terraform or CloudFormation