Storable is on a mission to power the future of storage with an innovative platform that helps businesses manage their self-storage operations. They are seeking a Senior Data Engineer to shape data operations, enhance data quality, and collaborate with cross-functional teams to drive informed decision-making.
Responsibilities:
- Oversee Data Pipelines: Design, implement, and maintain scalable data pipelines using industry-standard tools to efficiently process and manage large-scale datasets
- Ensure Data Quality & Governance: Implement data governance policies and frameworks to ensure data accuracy, consistency, and compliance across the organization
- ETL Development: Build, optimize, and maintain ETL pipelines for ingesting, transforming, and delivering large datasets from multiple sources
- Workflow Orchestration: Manage and schedule complex workflows using Apache Airflow
- Query Engines & Processing Frameworks: Leverage Trino (Presto), Apache Spark, and other
- Manage Cross-Functional Collaboration: Partner with engineering, product, and business teams to make data accessible and actionable, and ensure it drives informed decision-making
- Optimize Data Infrastructure: Leverage modern data tools and platforms (e.g., AWS, Apache Airflow, Apache Iceberg) to create an efficient, reliable, and scalable data infrastructure
- Monitor & Improve Performance: Proactively monitor data processes and workflows, troubleshoot issues, and optimize performance to ensure high reliability and data integrity
Requirements:
- 6+ years of significant experience in managing data infrastructure, data governance, and optimizing data pipelines at scale
- 5+ years of strong hands-on experience with data tools and platforms such as Apache Airflow, Apache Iceberg, and AWS services (S3, Lambda, Redshift, Glue, Athena)
- Familiarity with designing, implementing, and optimizing data pipelines and workflows in Python or other languages for data processing
- 5+ years of hands-on experience with Trino/Presto and Apache Spark for distributed data processing
- Solid understanding of data modeling, warehousing concepts, and schema design
- Solid understanding of data privacy, quality control, and governance best practices
- Ability to lead and mentor teams, influence stakeholders, and drive data initiatives across the organization
- Strong problem-solving abilities and a data-driven approach to improve business operations
- Ability to communicate complex data concepts to both technical and non-technical stakeholders effectively
- Experience with visualization tools (e.g., Looker, Tableau) and reporting frameworks to provide actionable insights