Lorven Technologies Inc. is seeking a Lead Data Engineer with extensive experience in data engineering and cloud solutions. The role involves designing and optimizing data pipelines, managing AWS data solutions, and integrating workflows to support AI/ML initiatives.
Responsibilities:
- Design, develop, and optimize data pipelines using Python and PySpark for batch and incremental processing
- Build and manage AWS based data solutions leveraging services such as S3, Glue, and cloud native processing frameworks
- Prepare, transform, and curate datasets to support AI/ML and GenAI model development
- Integrate data pipelines with AI/ML workflows, ensuring data quality, consistency, and traceability
- Implement data validation, profiling, and performance tuning to improve reliability and scalability
- Collaborate with data scientists, ML engineers, and platform teams to deliver end to end GenAI solutions
Requirements:
- Strong hands on experience with Python for data engineering and automation
- Proven expertise in PySpark / Spark for large scale data processing
- Experience working in AWS cloud environments for data engineering workloads
- Solid understanding of data engineering fundamentals, including ETL, data modeling, and performance optimization
- Experience supporting or working alongside AI/ML or GenAI initiatives
- Exposure to GenAI pipelines, model data preparation, or LLM driven workflows
- Experience with CI/CD, data quality frameworks, or cloud cost optimization
- Familiarity with SQL based analytics and metadata driven data processing