CitiusTech is a leading healthcare technology company that focuses on digital innovation and business transformation. They are seeking a Senior Data Engineer skilled in Spark SQL, PySpark, AWS/Azure Databricks, and Python to build scalable data processing pipelines and enhance data ingestion processes.
Responsibilities:
- Create saleable and high-performance data processing and ETL pipelines
- Work in a fast-paced, creative atmosphere to develop new ideas that adapt to evolving user needs
- Proven ability to build complex queries and doing data analysis
Requirements:
- 10+ Years of experience
- Engineering Degree – BE/ME/BTech/MTech/BSc/MSc
- Spark SQL
- Spark (PySpark)
- AWS Databricks
- Python
- ETL
- Data Warehousing
- Shell scripting
- Tableau
- Strong experience on AWS/Azure Databricks
- SQL
- Knowledge of ETL concepts
- Data Ingestion
- Shell – Dos, Bash scripting
- Pre-processing of data using Python and PySpark
- Proven ability to build complex queries and doing data analysis
- Create scalable and high-performance data processing and ETL pipelines
- Strong visual and verbal communication skills
- Technical certification in multiple technologies is desirable
- Knowledge of building ETL processing pipeline using AWS/Azure Databricks will be an added advantage
- Exposure to any cloud platform - AWS or Azure will be an added advantage