Act as the primary technical point of contact for Engineering and Product teams, providing daily operational support for Platform requests
Design, deploy, and manage scalable services on GCP
Build and optimize repeatable deployment pipelines and deployment blueprints
Monitor cloud expenditure and implement right-sizing initiatives to support departmental cost-reduction goals
Maintain comprehensive monitoring and alerting frameworks to ensure high availability and lead rapid incident response for critical systems
Execute stack-hardening tasks, including vulnerability scanning, access control audits, and secrets detection
Manage the ongoing updates, patching, and maintenance of platform components
Requirements
5–7 years in DevOps, SRE, or Cloud Engineering with a focus on high-availability production environments
3+ years of deep, hands-on experience with major cloud providers (GCP preferred, AWS, or Azure), including cloud architecture, resiliency, and disaster recovery
Expert-level proficiency with Terraform and a 'policy as code' mindset
Advanced knowledge of Docker and Kubernetes, including image management and cluster administration
Strong fundamentals in Python, Shell, or equivalent languages for automation and tool building
Proven experience building and maintaining modern pipelines (GitHub Actions, JFrog, CloudBuild)
Solid understanding of cloud networking (VPCs, Firewalls) and security practices, including IAM, secrets management, and vulnerability scanning
Tech Stack
AWS
Azure
Cloud
Docker
Firewalls
Google Cloud Platform
Kubernetes
Python
Terraform
Benefits
Generous Flexible Time Away policy
Fully remote company, with collaborative asynchronous teamwork