7Seventy Recruiting is seeking a Senior DevOps Engineer to design, build, and operate reliable, scalable cloud infrastructure. The role involves leading platform engineering initiatives, collaborating with application and security teams, and enhancing performance and operational efficiency across cloud environments.
Responsibilities:
- Lead platform engineering initiatives using Kubernetes (EKS), Helm, and Infrastructure as Code
- Design and operate CI/CD platforms and deployment strategies to enable safe, low-risk releases
- Build and maintain observability foundations, including metrics, logging, alerting, and dashboards aligned with service health
- Design, build, and operate secure and scalable AWS infrastructure, including VPCs, subnets, routing, NAT, VPNs, and Transit Gateway
- Own and evolve cloud networking and connectivity architecture to ensure secure and performant service communication
- Configure and manage Cloudflare for edge security, including WAF, DDoS protection, rate limiting, DNS, and traffic management
- Participate in incident response, root cause analysis, and blameless post-mortems to implement durable corrective actions
- Strengthen infrastructure security and compliance, including IAM, network controls, container security, and SOC 2 alignment
- Define and improve service reliability using SLIs, SLOs, and error budgets
- Reduce operational toil through automation and standardized platform patterns
- Drive cloud and network cost optimization initiatives
- Mentor engineers and contribute to a collaborative, high-performance engineering culture
Requirements:
- 6+ years of experience in DevOps, Platform Engineering, or SRE roles
- Strong hands-on experience with AWS, including networking and connectivity design
- Deep experience with Kubernetes (EKS)
- Experience with Infrastructure as Code tools such as AWS CDK, Terraform, or similar
- Experience designing and maintaining CI/CD pipelines (GitHub Actions preferred)
- Hands-on experience configuring and operating Cloudflare (CDN, WAF, DNS, edge security)
- Proficiency in scripting languages such as Python and Bash for automation
- Experience owning production systems with high availability, performance, and security requirements
- Strong understanding of cloud networking fundamentals, including routing, load balancing, security groups, and NACLs
- Solid knowledge of SRE principles and operational excellence practices
- Strong communication skills and ability to collaborate across technical teams
- Experience improving observability platforms such as Prometheus, Grafana, or OpenTelemetry
- Prior experience mentoring engineers or leading cross-team initiatives
- Consistent, reliable high-speed internet access and a dedicated, distraction-free workspace
- Experience with advanced AWS networking, including multi-account architectures and private connectivity
- Experience building or operating internal developer platforms
- Familiarity with SOC 2 or regulated environments
- Exposure to AI-assisted operations or intelligent automation