Sourcebooks is an innovative publishing powerhouse that believes in the transformative power of books and the necessity of an entrepreneurial mindset. They are seeking a contract data engineer to help scale their data platform, focusing on building and extending infrastructure for informed decision-making.
Responsibilities:
- Expand silver and gold table coverage by converting raw data into business-ready tables, incorporating business logic from analytics stakeholders
- Maintain and debug pipelines in Microsoft Fabric and Azure Data Factory
- Validate incremental load logic
- Investigate data issues
Requirements:
- 3-5 years of experience in analytics or ideally data engineering
- Strong SQL skills -- multi-CTE queries, and comfortable debugging code a team member wrote
- Hands-on experience with a cloud data platform (Fabric, Databricks, or similar), and Python/PySpark as a secondary skill, enough to read and modify existing notebooks
- Have worked within established systems, follow Spark SQL conventions, and communicate blockers clearly
- Experience with Microsoft Fabric
- Familiar with Delta Lake and medallion architecture
- Experience with financial transaction data (SAP, NetSuite)
- Azure CLI or REST API experience for platform-level queries
- Exposure to agentic AI workflows or AI-assisted development for debugging