Build real stuff on a cloud data platform: contribute to features and improvements across ingestion, transformation, and serving layers
Create & maintain ELT pipelines: bring new data in, transform it reliably, and keep it running as requirements evolve
Turn business questions into dashboards: collaborate with business stakeholders to create and improve reports and visualizations
Help keep these platforms healthy: support deployments, monitoring, and troubleshooting (you’ll learn how “production” really works)
Requirements
SQL fundamentals: you can write queries with joins and aggregations
Programming basics: experience with Python (preferred) or another modern language
Git basics: you understand version control concepts (commits, branches, collaboration)
Motivation: you’re genuinely interested in data engineering and how data powers analytics and business decisions
Curiosity about applied AI: you’re interested in how data enables AI/ML
AI-assisted development mindset: you use AI-powered IDE features and coding agents to accelerate learning and delivery while keeping quality and correctness in check.
You’ve done something practical: you’re pursuing a STEM degree and have at least one of these: relevant university courses, or online courses (Udemy/Coursera), or a data project (university or personal)
Student status: active student with at least 1 full semester remaining
Availability: at least 20 hours/week
Language: conversational English
Nice to have — bonus points, not blockers (curiosity outweighs experience)