Genesis10 is seeking a Data Engineer for a 6 month + contract position with an investment management firm. In this role, you'll contribute to developing efficient data pipelines for trading and risk management while modernizing legacy systems with cloud-based tools.

Responsibilities:

Design, develop, and optimize data pipelines for trading, alpha generation, research, risk management, accounting, and more
Build new golden source datasets such as security master, account master, and price master which are critical to the firm
Develop shared Python libraries for data APIs, logging, and other core functionalities
Ensure high data quality and observability using modern data governance tools
Optimize large-scale data processing workflows for efficiency and performance
Collaborate with technical and non-technical teams to understand data requirements and implement effective solutions
Partner with product managers and business stakeholders to understand data needs and translate them into scalable, well-documented data products
Support and troubleshoot data pipelines, APIs, and database performance issues

Requirements:

5+ years of development experience with 2+ years focused on data engineering
Bachelor's degree in computer science or related field
Great communication skills and capability for cross-functional collaboration
Excellent Python and SQL skills for data processing and automation
Extensive ETL/ELT pipeline experience and expertise
Strong understanding of data structures, data modeling, efficient query design, and performance tuning in a SQL database such as Postgres or MS SQL Server
DBT for data transformation
Experience building and deploying containerized applications (Docker, Kubernetes) in cloud environments
Hands-on, production-level experience with Snowflake for cloud data warehousing — this is a core, non-negotiable requirement. Comfort working across multiple warehouse and lakehouse tools is a strong plus
Proficiency with Apache Spark for large-scale and streaming distributed data processing
Experience with Apache Kafka for streaming data ingestion and pipeline development
Excellent problem-solving skills
Experience with Databricks for unified analytics and data engineering workflows
Hands-on experience with Apache Spark on AWS EMR for streaming and large-scale data processing use cases
Experience with Apache Iceberg for open table format and large-scale, mutable dataset management in a lakehouse architecture
Familiarity with data quality and observability tooling (DataFold, Great Expectations, DBT tests) and data catalog/governance platforms (OpenMetadata), including lineage integration
Financial market data literacy with product knowledge spanning equities, fixed income, futures, and options
Experience designing dashboards in a Business Intelligence tool
Skill with a Python web API framework such as FastAPI

Data Engineer

Key skills

About this role

Responsibilities:

Requirements: