Trafilea is a Consumer Tech Platform for Transformative Brand Growth, building the AI Growth Engine that powers the next generation of consumer brands. They are seeking a Senior Data Engineer to architect and scale the data infrastructure behind their Machine Learning platform, focusing on designing systems that turn vast amounts of data into production-ready models.
Responsibilities:
- Architect and scale advanced ETL pipelines using modern Big Data technologies
- Design resilient data frameworks across the full lifecycle: extraction → transformation → ML modeling
- Lead the development of Airflow-driven data workflows
- Optimize large-scale datasets and complex SQL queries
- Operationalize machine learning models in batch and real-time environments
- Improve cost-efficiency and scalability across our AWS data ecosystem
- Elevate data quality standards with strong governance and monitoring systems
- Build internal data tools that empower Marketing Science & Analytics teams
Requirements:
- 2–3+ years as a Data Engineer or ML Engineer in production environments
- You think in systems, not scripts
- You care about scalability, cost-efficiency, and precision
- You reject mediocrity — performance and reliability matter
- You document your work and raise the bar for quality
- AWS ecosystem: S3, Glue, Athena, Redshift, Lambda, EC2, RDS, EMR, VPC, ECS/EKS
- Apache Airflow for orchestration
- Python (object-oriented programming)
- Advanced SQL optimization
- CI/CD with GitHub Actions or GitLab
- Docker & Kubernetes (EKS/ECS, ECR)