Rocket Lawyer is the largest and most widely used online legal service platform in the world, aiming to enhance and expand its platform to capture audiences worldwide. They are seeking a Data Engineering Intern to build robust ETL processes and leverage AI technologies to automate workflows, supporting Business Intelligence efforts and influencing the product roadmap with actionable insights.
Responsibilities:
- Build & Orchestrate: Design, develop, and maintain ETL pipelines to ingest data into our Snowflake warehouse using Python, SQL, and Airflow
- AI-Driven Automation: Implement AI-powered solutions to streamline engineering tasks, including:
- Automating code generation and documentation
- Building AI-driven data quality checks and anomaly detection
- Developing "self-healing" pipelines that can identify and alert on ingestion errors
- Insight Generation: Use Jupyter Notebooks and Streamlit to analyze data and build internal tools that help our product team make data-driven decisions
- Visualization: Create high-impact dashboards in Tableau that translate complex data into a clear narrative for stakeholders
- Agile Collaboration: Participate in daily Scrum huddles, manage tasks via Jira, and work closely with product owners and QA to promote code to production
- Cloud Infrastructure: Interact with cloud services via CLI and manage containerized environments using Docker and Kubernetes
Requirements:
- Currently pursuing an undergraduate degree with a targeted graduation date in 2026 or early 2027, or pursuing a degree in Computer Science, Data Science, or a related quantitative field
- Expertise in Python and SQL
- A strong understanding of Data Warehousing (Snowflake) and ETL orchestration (Airflow)
- Familiarity with CLI, Docker, and Kubernetes for managing cloud-based environments
- Experience with Jupyter Notebooks, Tableau, or Streamlit
- A proactive approach to using AI/LLMs to automate repetitive tasks and improve system reliability