The Hanover Insurance Group is a company that has been delivering on its promises for over 170 years. They are seeking a Data Engineer II to manage the reliability and enhancement of Azure Data Factory pipelines and SQL Server data environment, focusing on building ETL workflows and improving data quality.
Responsibilities:
- Production support and daily health of scheduled ADF pipelines
- Build new pipelines and enhance existing pipelines to improve resiliency, maintainability, and scalability
- Implement data validation controls, improve monitoring/alerts, and help define SLAs for data freshness and availability
- Establish foundational SDLC practices for data engineering (Git usage, Dev→Prod promotion practices, and a more formal release process)
- Coordinate cross-team dependencies where upstream internal ETL timelines affect downstream pipeline completion; design dependency-aware orchestration and readiness checks
- Contribute to future-state Azure data strategy recommendations (e.g., Data Lake/Blob Storage, notebooks) and support long-term migration planning from on-prem SQL Server to cloud databases (timeline not yet defined)
- Design and maintain curated SQL tables/views used for analytics and reporting; optimize for refresh performance and downstream usability
- Develop and maintain ADF pipelines (Copy activities and Data Flows) with consistent patterns for logging, error handling, retries, and notifications
- Implement parameterization and reusable components to reduce duplication and speed enhancements
- Implement incremental load and backfill strategies appropriate to volume
- Monitor daily pipeline execution and triage incidents quickly to restore successful processing
- Perform root-cause analysis for failures and recurring issues; implement preventative controls and standardized patterns
- Create and maintain operational documentation/runbooks for critical pipelines and common support scenarios
- Build automated validation routines and reconciliation checks (row counts, totals, null/duplicate thresholds, schema drift detection, anomaly flags)
- Partner with analysts and stakeholders to define key business rules and quality thresholds for trusted reporting datasets
- Document data definitions, transformations, and lineage to improve transparency and troubleshooting
- Work directly with internal Operations Data Analysts, business stakeholders and external partner analysts to gather requirements and deliver datasets that enable robust Power BI semantic models, reporting and other analytical solutions
- Coordinate with upstream internal teams to align dependency readiness signals and timelines; implement orchestration controls to prevent downstream failures
- Provide technical coaching and mentoring to less experienced team members; including the usage of best practices
Requirements:
- Bachelor's Degree preferred in a related field
- 6+ years of experience in data engineering, ETL/ELT, or related roles
- Hands-on experience building and supporting production pipelines in Azure Data Factory
- Strong SQL/T-SQL skills and experience with SQL Server environments supporting analytics/reporting workloads
- Advanced data modeling
- Experience integrating data from multiple sources including databases, flat files/SFTP, and APIs/vendor feeds
- Familiarity with Azure data platform components (Data Lake/Blob Storage, notebooks) and/or cloud migration planning, or similar platforms
- Strong troubleshooting, root-cause analysis, and operational ownership mindset
- Ability to work directly with stakeholders to gather requirements and translate them into technical solutions
- Strong communication and documentation skills: Data engineers are often called to present their findings or translate the data into an understandable document. You will need to write and speak clearly, easily communicating complex ideas
- Experience establishing SDLC practices for data teams (Git, CI/CD, release management)
- Experience implementing monitoring/alerting and operational dashboards for data pipelines
- Python and/or PowerShell for automation, data validation, and operational tooling
- Insurance or operations/customer service experience
- Familiarity with prompt-based programming and tools (GitHub CoPilot)