McGraw Hill is a company focused on building meaningful enterprise learning solutions for students and institutions. The Lead Data Engineer role involves transforming various data sources into actionable insights and overseeing platform performance while architecting scalable reporting and analytics solutions.
Responsibilities:
- Develop and deliver scalable data solutions using AWS and/or Oracle technologies, leveraging strong data modeling skills, SCD practices, and modern cloud architecture
- Translate business requirements into technical designs, creating detailed specifications, solution documentation, and end-to-end integration models (facts, dimensions, star schemas, aggregations)
- Build optimized, parallel-processing ETL pipelines using Informatica and advanced transformations, supported by Unix scripting for workflow automation, cleanup, and file management
- Manage and enhance cloud platform performance to meet daily runbook SLAs, ensuring reliable, efficient data operations across financial and operational systems like Oracle ERP and Oracle DB
- Provide technical leadership through the full software development lifecycle, offering guidance from requirements through implementation while supporting Agile/Kanban delivery practices
- Utilize Git, Jira, and other engineering tools to maintain version control, collaborate effectively, and support continuous improvement in data engineering processes
Requirements:
- 8+ years of data engineering experience using Informatica/IICS, Oracle DB, Oracle packages, AWS data services such as Athena with Iceberg, Lambda, EMR, Glue, and platforms like Databricks
- Expertise in data warehousing, modern data lake architectures, and large-scale data engineering practices
- Strong programming skills in Python, Scala, Java, or Node, along with at least 1 year of Unix shell scripting
- 5+ years working with cloud environments including OCI, AWS, or Azure with a focus on data technologies
- Experience working with financial datasets such as sales, revenue, COGS, and manufacturing, and familiarity with Tableau or Alteryx
- Preferred experience in the publishing or education domain
- Experience with IBM Planning Analytics (TM1)