ICF is a global advisory and technology services provider, and they are seeking a Senior Data Engineer with a focus on data and AI governance. The role involves building and operating reliable data pipelines, designing AI agents, and implementing data governance practices to ensure traceability and security in data operations.
Responsibilities:
- Design, build, and operate batch and streaming data pipelines on modern cloud data platforms
- Develop robust ETL/ELT processes using SQL, Python, and PySpark with strong error handling, monitoring, and cost awareness
- Implement layered / medallion data architectures and analytics-ready data models to support BI and AI workloads
- Partner with analysts and data scientists to deliver trusted, production-grade data assets
- Lead the implementation and evolution of data governance practices using Microsoft Purview or comparable governance platforms
- Define and operationalize governance standards for data products, AI/ML features, and agent-based workflows
- Ensure governance is embedded into platform workflows rather than handled as an after-the-fact process
- Act as a de facto owner for data and AI governance patterns as standards continue to evolve
- Design and build AI agents and intelligent automation workflows that support data platform and governance operations
- Develop agents to assist with metadata enrichment, data quality checks, anomaly detection, and governance workflows
- Ensure AI agents operate only on approved, governed data sources with appropriate logging and auditability
- Contribute to evolving AI governance patterns, including agent lifecycle management and input/output traceability
- Implement and maintain CI/CD pipelines for data and AI assets
- Promote infrastructure-as-code practices (Terraform, Bicep, or equivalent) for repeatable, governed environments
- Define environment promotion paths (dev, test, prod) with embedded governance and policy checks
Requirements:
- Bachelor's degree AND 5+ years of experience in data engineering or data platform engineering in a production environment
- SQL and hands-on experience with Python and PySpark
- Experience in data engineering on at least one major cloud or data platform (Azure, AWS, Databricks, or equivalent), with skills that are transferable across platforms
- Experience with Git-based development and CI/CD practices
- Experience with data governance concepts such as cataloging, lineage, data quality, and access control
- Experience with engineering judgment and balancing speed, safety, and governance
- Hands-on experience with Microsoft Purview or comparable data governance platforms
- Experience building AI agents, LLM-based automation, or intelligent assistants for operational or platform workflows
- Experience embedding governance controls into DevOps workflows (policy-as-code mindset)
- Familiarity with Microsoft Fabric or Databricks-centric architectures, and the ability to learn new platforms quickly