Apetan Consulting LLC is seeking an experienced Senior Lead Data Engineer to design, develop, and manage scalable data platforms and pipelines that support analytics, reporting, and business operations. The ideal candidate will lead data engineering initiatives, mentor team members, and collaborate with cross-functional teams to build reliable and high-performance data solutions.
Responsibilities:
- Design, develop, and maintain scalable data pipelines and ETL/ELT processes
- Build and optimize data architectures, data lakes, and warehouse solutions
- Lead end-to-end data engineering projects and ensure timely delivery
- Develop robust data models for analytics and reporting requirements
- Work with large-scale structured and unstructured datasets
- Ensure data quality, integrity, governance, and security standards
- Optimize database and pipeline performance for high-volume processing
- Collaborate with data analysts, data scientists, DevOps, and business stakeholders
- Mentor junior engineers and provide technical leadership to the team
- Implement automation, monitoring, and CI/CD practices for data workflows
- Evaluate and recommend new tools, technologies, and best practices
Requirements:
- Strong expertise in SQL, Python, and data engineering concepts
- Hands-on experience with ETL/ELT frameworks and workflow orchestration tools
- Experience with cloud platforms such as Amazon Web Services, Microsoft Azure, or Google Cloud
- Knowledge of big data technologies such as Spark, Hadoop, or Kafka
- Experience with data warehousing solutions like Snowflake, Redshift, or BigQuery
- Strong understanding of database design, performance tuning, and optimization
- Familiarity with CI/CD pipelines and version control tools like Git
- Excellent problem-solving, leadership, and communication skills
- Bachelor s or Master s degree in Computer Science, Engineering, Information Technology, or related field
- Experience leading engineering teams or enterprise-scale projects
- Cloud or data engineering certifications are a plus
- Exposure to machine learning pipelines or real-time streaming architectures is an advantage