Position-Data Lake Engineer
Only locals required – No Relocation
Experience: 8+ years
Independent candidates only
Location-Ideally 2-3 days onsite, Chicago, IL 60606
Job Description:
This position is responsible for being part of a growing team to support data lake repository and related ETL processes using Fivetran, Databricks and other technologies. In addition to creation of data lake systems this job is also responsible for data ETL, data curation and data quality check processes. This position supports the Data Engineering team and works in an Agile environment.
Duties:
- The ideal candidate will have a solid background in database and data lake development using technologies such as Fivetran, Databricks, Spark, AWS data lake, and traditional technologies like ETL & MS SQL Server. The position will be part of a team building an end to end data lake and data pipelines on AWS using Agile project management methodologies
- Design and Implement end to end Data Lake solutions using the technologies listed above as well as:
- Data integrity process using Databricks and related tools/processes
- Integrating Data Lake with 3rd party application APIs for downstream queries
- Access design and integrations for Data Lake users and applications
- Experience with AWS and related data technologies and concepts in the following areas is required:
- Python
- Athena
- Glue/Crawlers
- Lake Formation
- ETL
- Spark (PySpark a plus)
- Workflows & automation
- Triggers
- Lambda Functions
- IAM data lake security concepts/permissions
- Data Onboarding – define onboarding procedures and work with business stakeholders to onboard new data sources
Requirements:
- 5+ years of database and data management experience.
- Fivetran, Databricks, AWS or other related data certifications are highly preferred.
- Understanding of cloud technologies such as IaaS and SaaS.
- Experience working in an Agile environment.
- 4 year college degree in information technology or equivalent experience.