Collaborate with operations team members and Principal and Staff Engineers to understand desired system level outcomes and operational requirements.
Manage scalable cloud network topologies, including transit gateways, VPCs, subnets, security groups, and route tables.
Design and maintain automated delivery pipelines (Infrastructure as Code).
Provide technical input and guidance regarding responsible components during planning and design sessions with Principal and Staff Engineers.
Provide hands-on technical contributions to the infrastructure design and deployment with approximately 80% on solving technical problems and 20% mentoring and coordinating.
Act as the second escalation point for complex incidents, leading thorough root cause analysis (RCA) and implementing systemic, long-term fixes.
Requirements
5+ years in cloud operations, or DevOps, with focus on production workloads in mission-critical environments.
Advanced proficiency in AWS (EC2, VPC, IAM), CloudFormation or Terraform, and CI/CD tools such as GitHub Actions.
Deep understanding of VPC design, subnetting, Route53, PrivateLink and related AWS technologies; experience troubleshooting complex connectivity issues across multi-account or hybrid on-prem/cloud environments.
Demonstrated ability to take ownership of a problem and deliver a production-ready solution.
Proficient in one or more software languages such as Node.js or Python.
Strong experience with observability stacks (e.g., Splunk, DataDog, or CloudWatch) including logging, tracing, and metrics tuning.
Travel quarterly to Product Increment planning.
Must be a U.S. Citizen and able to obtain a DoD NIPR network account and Common Access Card (CAC)
Must have, or be able to obtain, a Secret Clearance.
Tech Stack
AWS
Cloud
EC2
JavaScript
Node.js
Python
Splunk
Terraform
Benefits
Equity
Medical, Life, Short-Term Disability, and AD&D insurance