First American is a company that values its people and fosters an inclusive environment. They are seeking a Platform Engineer II to evolve and operate their multi-cloud backup platform, focusing on infrastructure engineering, operational reliability, and automation development.
Responsibilities:
- Design, implement, and maintain cloud infrastructure supporting enterprise backup services across AWS and GCP
- Define and manage infrastructure using Infrastructure-as-Code (Terraform preferred)
- Translate engineering requirements into secure, scalable cloud architecture solutions
- Modify and enhance existing platform components to improve resiliency, performance, and maintainability
- Build and improve CI/CD pipelines supporting infrastructure and platform deployments
- Contribute to IAM design, least-privilege access models, and secure cloud architecture patterns
- Develop detailed technical specifications and documentation for infrastructure implementations
- Partner with Security, Infrastructure, and Application teams to ensure successful platform integrations
- Monitor and maintain production backup jobs, replication workflows, and recovery processes
- Troubleshoot backup failures and restore issues across AWS and GCP environments
- Perform platform maintenance, installations, upgrades, and lifecycle management activities
- Participate in incident response, root cause analysis, and corrective action planning
- Support disaster recovery testing and validation efforts to ensure alignment with RPO/RTO objectives
- Improve operational runbooks and documentation to enhance reliability and efficiency
- Participate in on-call support as required by business needs
- Leverage AI-assisted development tools to reduce operational toil, improve code quality, and automate repetitive engineering tasks
- Develop automation scripts and tooling (Python, Bash, or similar) to reduce manual operational effort
- Build and enhance pipeline automation to ensure the quality and reliability of infrastructure changes
- Create internal tools used by Platform Engineering and partner teams
- Contribute to automated validation and regression testing for platform updates
- Document technical designs and implementation details to support knowledge sharing
Requirements:
- 2–5 years of directly related experience in cloud infrastructure, platform engineering, or DevOps
- Experience defining infrastructure and platform capabilities using Infrastructure-as-Code and automation technologies
- Hands-on experience with AWS required; familiarity with GCP strongly preferred
- Experience working with CI/CD pipelines and build automation tools
- Practical experience leveraging AI-assisted development or automation tools to improve engineering productivity and reduce manual effort
- Experience troubleshooting production cloud systems
- Bachelor's degree in Computer Science, Information Technology, or related field or equivalent combination of education, certifications, and experience
- Proficiency in at least one scripting language (Python, Bash, or similar)
- Familiarity with multiple IaC tools (Terraform preferred; CloudFormation acceptable)
- Experience with CI/CD pipelines, automation, and build tools
- Working knowledge of cloud networking fundamentals (VPCs, subnets, routing, security groups/firewalls)
- Familiarity with AWS and GCP core services (IAM, object storage, logging/monitoring, compute)
- Familiarity with LLM-assisted coding tools and models (e.g., Cursor/Copilot, and Claude/GPT) and their secure use in dev/CI workflows
- Understanding of backup and disaster recovery concepts (RPO, RTO, retention policies, immutability)
- Experience with monitoring and logging systems (CloudWatch, Cloud Logging, or similar)
- Strong written and verbal communication skills