Scorpion is the leading provider of technology and services helping local businesses thrive. The Senior Data Engineer plays a critical role in building and evolving the trusted data foundation that powers Scorpion's AI products, tools, and decision-making.
Responsibilities:
- Evaluate and improve data quality, completeness, and consistency across 30+ databases, applications, platforms, and APIs
- Design and build Scorpion's analytical data platform, creating a trusted source of truth for business and client data
- Develop and maintain scalable data pipelines that efficiently move data from operational systems into the analytical platform
- Help teams transition from querying production databases directly to using trusted analytical data sources
- Design data access patterns that enable AI agents and applications to quickly retrieve relevant client and business information
- Build secure, scalable solutions that deliver client context and business insights to internal applications and AI-powered experiences
- Define, monitor, and improve service level agreements (SLAs) for data freshness, availability, reliability, and performance
- Partner with engineering, product, and data science teams to establish data standards, governance practices, and data contracts
- Continuously improve the scalability, performance, and reliability of Scorpion's data ecosystem
Requirements:
- Bachelor's degree in Computer Science, Data Engineering, Information Systems, Software Engineering, or a related technical field, or equivalent practical experience
- 7+ years of data engineering experience, including designing and operating production-scale analytical data platforms
- Experience building scalable data pipelines, analytical systems, and data platforms in modern cloud environments
- Experience integrating and unifying data from multiple systems into trusted, business-ready analytical platforms
- Experience supporting data access and consumption patterns for applications, analytics, machine learning, or AI-powered solutions
- Experience designing scalable data models that support analytics, reporting, and AI-driven applications
- Experience establishing data governance standards, data contracts, and documentation practices across teams
- Deep expertise with Databricks, Delta Lake, and Apache Spark for large-scale data processing, streaming ingestion, and data transformation
- Experience working with Snowflake, BigQuery, ClickHouse, Azure Synapse, or similar analytical database technologies
- Advanced SQL skills, including query optimization, execution planning, and performance tuning
- Strong Python skills for data pipeline development, automation, transformation, and integration with production systems
- Experience building and managing data pipelines using dbt, Airflow, Prefect, or similar orchestration and transformation tools
- Strong understanding of lakehouse architecture, data partitioning strategies, metadata management, and data security best practices
- Ability to collaborate effectively across engineering, product, data science, and business teams
- Strong communication skills with the ability to translate business requirements into scalable data solutions
- Strong analytical and problem-solving skills with a focus on building scalable, maintainable systems
- Ability to balance long-term platform strategy with near-term business priorities