Design and implement infrastructure across multiple cloud platforms (AWS, GCP, OCI) and CrowdStrike managed datacenters
Develop cloud expertise across multiple platforms, becoming proficient in cloud-native services and best practices for each provider
Create standardized patterns and frameworks for deploying services across different cloud providers
Monitor and maintain the health, performance, and reliability of multi-cloud infrastructure processing trillions of events daily
Implement comprehensive monitoring with Service Level Indicators across different cloud platforms to enable proactive alerting
Conduct capacity planning and forecasting based on workload patterns across cloud environments
Provision and scale infrastructure (vertical/horizontal) based on demand and cloud-specific capabilities
Develop automation tools and microservices for platform components across multiple cloud platforms
Orchestrate version upgrades, patch management, and configuration changes with minimal customer impact
Establish security and compliance controls tailored to each cloud platform's requirements
Collaborate with product management, engineering, and customer support teams to align infrastructure capabilities with product vision and customer needs
Act as a technical liaison between FrontTier Expansions and internal business partners to ensure infrastructure supports business objectives
Contribute to delivering world-class performance and reliability for customers
Build and maintain CI/CD pipelines for testing and releasing configuration and software across clouds
Deploy, operate, and troubleshoot applications in Kubernetes clusters across managed and self-managed environments
Participate in on-call rotation to support production infrastructure globally
Requirements
5+ years of experience with large-scale, business-critical Linux-based environments
Hands-on experience with at least one major cloud provider (AWS, GCP, or OCI) with eagerness to learn others
Experience with configuration management and Infrastructure as Code using Chef, Ansible, or similar tools
Experience deploying, operating, and troubleshooting applications in Kubernetes clusters in production environments
Proficiency in scripting and programming using Python, Bash, or Go
Experience working with CI/CD tools (Jenkins, Git, Artifactory, Bitbucket)
Understanding of distributed systems and data processing at scale
Knowledge of networking concepts including VPCs, load balancers, and DNS
Proven ability to work effectively with remote teams across multiple time zones
Strong analytical skills with ability to make data-driven infrastructure decisions
Excellent communication skills for documenting solutions and collaborating with distributed teams
Curiosity about and ability to investigate new cloud technologies, systems, and tools
A can-do attitude; you thrive collaborating in a team and are not afraid of taking on responsibilities
Availability for on-call on a rotational basis
Tech Stack
Ansible
AWS
Chef
Cloud
Distributed Systems
DNS
Google Cloud Platform
Jenkins
Kubernetes
Linux
Microservices
Python
Go
Benefits
Market leader in compensation and equity awards
Comprehensive physical and mental wellness programs
Competitive vacation and holidays for recharge
Paid parental and adoption leaves
Professional development opportunities for all employees regardless of level or role
Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections