Deckers Brands is committed to creating an inclusive workplace where employees can be their authentic selves. The Platform Engineering Lead will be responsible for the reliability, security, and scalability of the company's cloud infrastructure, focusing on AWS platform delivery and operational support.
Responsibilities:
- Support and maintain AWS cloud infrastructure across EC2, S3, Redshift, IAM, VPC, subnets, and networking components
- Troubleshoot and debug infrastructure as code in CloudFormation and Terraform, enhancing and correcting configurations
- Manage DevOps operations including CI/CD pipeline automation and deployment troubleshooting
- Implement and uphold security best practices, including identity and access management, least privilege, and secure configuration standards
- Monitor and optimize cloud resource utilization, cost, and performance, recommending improvements for reliability and efficiency
- Take ownership of infrastructure tasks, driving them to completion with clear communication of status and risks
- Proactively resolve infrastructure issues, including incident response, root cause analysis, and preventive actions
- Document infrastructure configurations, operational procedures, and runbooks to boost team effectiveness and reduce risk
- Provide occasional support for GCP-based infrastructure and services
- Collaborate with engineering, security, and IT partners to deliver stable platform capabilities and enhance developer experience
Requirements:
- Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent practical experience
- 5+ years of hands-on AWS cloud infrastructure experience
- 2+ years in a lead or senior engineering role with demonstrated ownership of infrastructure deliverables
- Hands-on experience with AWS services including Redshift, S3, IAM, VPC, subnets, CloudFormation, and EC2
- Strong knowledge of Terraform with ability to read, understand, and debug existing code
- Demonstrated ability to take ownership and deliver results independently with strong analytical and troubleshooting skills
- Ability to work effectively with cross-functional teams
- Strong troubleshooting and problem-solving skills across cloud infrastructure, networking, IAM, and CI/CD systems
- Ability to balance operational support with proactive improvements and automation
- Clear written and verbal communication skills, including documentation and runbook creation
- Strong ownership mindset and accountability for production stability and delivery commitments
- Ability to prioritize effectively and manage multiple concurrent infrastructure tasks
- Relevant AWS certifications such as Solutions Architect or SysOps Administrator are a plus
- Basic knowledge of GitLab and version control systems
- Basic familiarity with Google Cloud Platform (GCP) services
- Experience developing new Terraform modules and configurations
- Experience with Boto3 (AWS SDK for Python) and Python scripting/automation
- Proficiency with AWS CLI