EXL is a leading data analytics and digital operations company that partners with clients to reinvent business models using data and AI. They are seeking an experienced Senior AWS Cloud Engineer to lead the architecture, development, and optimization of a next-generation data platform, while mentoring team members and delivering impactful solutions in a fast-paced environment.
Responsibilities:
- Own the deployment and life cycle management of dedicated data environments (e.g., development, staging, production) to ensure separation, security, and scalability for our Data Engineering teams
- Architect and configure high-throughput, low-latency networking (VPC peering, PrivateLink, Transit Gateway) to securely connect our core data compute engines (Snowflake, Redshift, Athena) and data stores (S3 Data Lakes) with our application and corporate environments
- Collaborate with DataOps teams to integrate Data Environment provisioning into CI/CD pipelines to ensure that all data infrastructure (e.g., Glue Catalog, EC2 for Spark/dbt runners, ECS clusters) is deployed consistently and securely
- Ensure all data-intensive services and their underlying compute/storage components are designed with disaster recovery and high-availability best practices
- Lead the strategic planning, design, and implementation of complex AWS networking solutions, including highly available and fault-tolerant VPC architectures, Subnets, Route Tables, and Network Access Control Lists (NACLs)
- Architect and manage secure, high-speed connectivity solutions between AWS and on-premises environments, utilizing technologies such as AWS Direct Connect, VPN, and Transit Gateway
- Design and optimize the deployment of compute services, including sophisticated configurations of EC2 instances and container orchestration using ECS (Elastic Container Service)
- Implement and enforce security best practices across all infrastructure, focusing on Identity and Access Management (IAM), Security Groups, and integrating services like AWS WAF and Shield
- Lead the development and maintenance of our cloud foundation using IaC tools to ensure immutable, repeatable, and scalable deployments
- Drive continuous optimization of AWS resource consumption and costs, utilizing RIs, Savings Plans, and automated scaling policies across all environments, including the data platform
- Automate routine operational tasks, infrastructure health checks, and security remediation using AWS Lambda, Systems Manager, and scripting languages (e.g., Python, Bash)
- Act as the top-tier technical expert for major infrastructure incidents, quickly diagnosing and resolving complex networking and platform issues to restore service availability
Requirements:
- 10+ years of progressive experience in designing, implementing, and managing large-scale cloud infrastructure, with a deep specialization in AWS
- Expert-level knowledge of AWS networking fundamentals, including TCP/IP, DNS, Routing, Load Balancing (ALB/NLB), and IP addressing schemas
- Proven experience with containerization and orchestration, specifically AWS ECS. Experience with EKS is a plus
- Deep practical experience with CI/CD and DevOps practices, focusing on infrastructure automation
- AWS Certification at a Professional level (e.g., AWS Certified Solutions Architect - Professional or AWS Certified Advanced Networking - Specialty) is highly desirable
- Collaboration: Excellent communication skills, capable of working effectively with Data Engineers, Data Scientists, and other teams
- Insurance industry experience preferred but not mandatory