Sciata is seeking a Senior Data Engineer to support a fast-paced SaaS, data-driven product environment. This role will focus on data platform engineering, infrastructure, and ML enablement, with an emphasis on building scalable pipelines and supporting analytics and machine learning workflows.
Responsibilities:
- Design and build scalable batch and streaming data pipelines
- Develop data infrastructure for analytics and ML workflows
- Build secure, scalable APIs for data and model access
- Support event-driven architecture using Kafka and orchestration using Airflow
- Partner with Data Science and ML teams to deliver platform capabilities
- Ensure strong data observability, governance, and system reliability
Requirements:
- Strong recent hands-on experience with Kafka and Airflow
- Experience with SQL and NoSQL databases
- Strong AWS or other cloud platform experience
- API development experience
- Experience with Docker, Kubernetes, Terraform/CloudFormation, CI/CD, and observability tools
- Must be based in the US
- Must be able to work full-time during U.S. business hours
- Experience supporting ML pipelines
- Exposure to SageMaker, OpenAI, or similar tools
- Background in SaaS, data platform, or ML infrastructure environments