Analyze, design, develop, test, review, document and troubleshoot data pipeline / ELT solutions against multiple structured and unstructured data sources.
Support our team of analysts through developing requirements and delivering solutions.
Develop code to scrape public websites for data and perform ELT processes.
Maintain, monitor, and support production ELT processes and respond to error and emergency issues.
Requirements
Excellent knowledge and experience with Big Data concepts like data lakes, data warehouses, ELT strategies, and best practices.
Strong understanding of relational and dimensional data modeling.
Strong analytical and problem-solving skills.
Experience with DBT and SQL, and proficient with Python.
Extensive experience with cloud-based data processing and warehousing technologies (Databricks, Snowflake, etc).
Experience with Lean and Agile development methodologies (such as Kanban or SCRUM).
Comfortable in entrepreneurial, self-starting, and fast-paced environments, working both independently and with highly skilled teams.
Experience with other Big Data processing technologies and cloud services (AWS, GCP, Snowflake, Hive, Hadoop, MS SQL, etc.).
Experience with JIRA and similar organizational tools.
Experience building web-scraping tools against publicly available datasets (considered an asset).
Experience with GIS/geospatial data processing, integration, and analysis (considered an asset).
Experience building or supporting data visualizations (considered an asset).
Deep intellectual curiosity with a results-focused relentless pursuit of answers. Ability to work in a fast-paced start-up environment, embrace change and ambiguity.
Hunger to learn and contribute. Our organization is growing and this role is an exceptional opportunity to grow with us.
Tech Stack
AWS
Cloud
Google Cloud Platform
Hadoop
Python
SQL
Benefits
Comprehensive health, dental, and vision benefits
Savings program with company matching
Learning and development budget
Generous time off including paid vacation and sick days