Innoventrics is seeking a skilled Data Engineer with expertise in Google Cloud Platform (GCP), AI/ML, and Large Language Models (LLMs). The ideal candidate will design, build, and optimize scalable data pipelines and AI-driven solutions, working closely with data scientists and business stakeholders.
Responsibilities:
- Design, develop, and maintain scalable data pipelines on GCP
- Build and optimize ETL/ELT workflows using tools like Cloud Dataflow, Dataproc, and BigQuery
- Collaborate with AI/ML teams to deploy and support machine learning models in production
- Integrate and manage LLM-based applications (e.g., prompt engineering, fine-tuning, RAG pipelines)
- Develop data architectures supporting real-time and batch processing
- Ensure data quality, governance, and security best practices
- Optimize performance and cost of cloud-based data systems
- Work with APIs and external data sources for ingestion and processing
- Implement monitoring, logging, and alerting for data workflows