Cylinder is a company focused on delivering personalized, clinician-backed care for digestive health issues through its virtual health platform. They are seeking a highly experienced Senior Data Engineer to lead data infrastructure initiatives, mentor the engineering team, and architect scalable data solutions to improve patient outcomes.
Responsibilities:
- Architect and design scalable, fault-tolerant data pipelines for complex healthcare data ingestion, transformation, and reporting
- Lead technical decision-making for data infrastructure and tooling selection
- Mentor junior engineers and establish engineering best practices across the team
- Drive cross-functional collaboration with Data Science, Business Intelligence, and Product teams
- Champion data quality initiatives and establish robust monitoring and alerting systems
- Design and implement solutions for complex distributed system challenges
- Evaluate and implement emerging technologies to maintain our competitive edge
- Design and implement enterprise-grade data architectures with emphasis on scalability, reliability, and maintainability
- Lead code reviews and establish coding standards that promote excellence across the engineering organization
- Drive strategic partnerships with product stakeholders, translating complex business requirements into elegant technical solutions
- Own end-to-end delivery of critical data infrastructure projects
- Establish comprehensive testing strategies, documentation standards, and operational procedures
- Continuously evaluate and optimize system performance, cost efficiency, and reliability
- Serve as subject matter expert for healthcare data compliance and security requirements
Requirements:
- Bachelor's degree in Computer Science, Engineering, or related field, or equivalent professional experience
- 5+ years of demonstrated experience architecting and building production data pipelines and MLOps environments
- Expert-level proficiency in SQL and Python with demonstrated experience in performance optimization
- Deep expertise with modern data stack technologies including dbt, Airflow, BigQuery, and cloud-native solutions
- Extensive experience with major cloud platforms (AWS, GCP, or Azure) including infrastructure-as-code practices
- Proven track record working with healthcare, medical, or population health data
- Outstanding communication and leadership skills with experience mentoring engineering teams
- Experience establishing data governance and compliance frameworks in regulated industries
- Technical Leadership: You have a proven track record of architecting complex data systems and leading technical initiatives in fast-paced environments
- Healthcare Impact: You're passionate about leveraging data to improve patient outcomes and understand the unique challenges of healthcare data
- Security Excellence: You have deep expertise in healthcare data security, HIPAA compliance, and implementing robust data governance frameworks
- Collaboration & Mentorship: You excel at leading cross-functional teams and developing junior talent
- Innovation Mindset: You stay current with emerging technologies and can evaluate their strategic fit for our organization
- Advanced degree in Computer Science, Data Engineering, or related field
- Extensive experience with Google Cloud Platform, including BigQuery, Vertex AI, IAM, Pub/Sub, Dataflow, and Cloud Composer
- Expert-level experience with Terraform and infrastructure automation
- Experience with real-time streaming data architectures
- Track record of successfully scaling data systems in high-growth startup environments
- Published thought leadership or contributions to open-source data engineering projects