EXL is a global analytics and digital solutions company that enhances business outcomes through data management and innovative solutions. The role of Big Data Analytics & Machine Learning Engineer involves designing and maintaining scalable data pipelines, collaborating with data scientists to implement machine learning models, and optimizing data management processes for enterprise datasets.

Responsibilities:

Design, build, and maintain scalable data pipelines to process and transform large-scale enterprise datasets
Collaborate closely with Data Scientists to operationalize basic ML models, ensuring they perform efficiently in a distributed computing environment
Utilize Hive for data warehousing and management, ensuring high performance for complex analytical queries
Write complex, optimized SQL and PySpark code to extract insights and prepare features for modeling
Optimize Spark jobs and SQL queries to handle enterprise scale data sets with minimal latency
Serve as the technical bridge between data engineering infrastructure and the data science lifecycle

Requirements:

5–8 years of professional experience in Big Data Engineering or ML Engineering
Expert-level proficiency in PySpark
Strong SQL skills for deep-dive analysis and extensive experience working with Hive databases
Advanced Python skills, specifically for data science libraries (Pandas, Scikit-learn, NumPy)
Previous experience working as an ML Engineer, including model deployment, monitoring, and feature engineering
Proven track record of handling 'messy' enterprise data at scale
Familiarity with CI/CD pipelines for ML (MLOps)
Previous experience in Banking/Capital Markets/Financial Services is required

Big Data Analytics & Machine Learning Engineer

Key skills

About this role

Responsibilities:

Requirements: