Empiric is seeking an experienced Data Engineer to join our team and play a key role in designing, building, and maintaining scalable data pipelines. In this role, you will transform complex, real-world data into analytics-ready datasets and help enable meaningful insights through downstream analytics and reporting.
Responsibilities:
- Design, develop, and maintain relational databases and cloud-based data warehouses
- Develop sophisticated, optimized SQL queries, stored procedures, and functions to process and analyze large, complex datasets for actionable business insights
- Help build, automate, and orchestrate ETL/ELT workflows utilizing SQL, Python, and cloud-native tools to integrate and transform data from diverse, distributed sources
- Tune queries and optimize database schema (indexing, partitioning, normalization) to improve data retrieval and processing speeds
- Ensure data quality, consistency, and integrity across systems. Implement data masking, encryption, and role-based access control (RBAC)
- Maintain technical documentation for database schemas, data dictionaries, and ETL workflows
Requirements:
- Bachelor's degree in computer science, Information Systems, or a related field
- 5+ years of experience with advanced SQL
- Deep understanding of relational database management systems and data modeling techniques
- Demonstrated experience with Microsoft Fabric and other data warehouse technologies
- Proficiency in Python for scripting and data manipulation
- Familiarity with tools like SSIS and Microsoft Fabric