Solugenix is assisting a client in their search for a Lead DevOps & Cloud Engineer. This role involves owning and evolving cloud infrastructure, deployment pipelines, and platform reliability, with a focus on AWS-centric architecture and automated CI/CD practices.
Responsibilities:
- Design, provision, and maintain scalable AWS infrastructure using IaC tools (Terraform / CloudFormation)
- Manage and optimize ECS Fargate workloads; lead the migration path to EKS as the platform matures
- Administer MySQL RDS instances: performance tuning, patching, backup/restore, and high-availability configuration
- Architect and manage supporting AWS services: VPC, IAM, ALB/NLB, S3, CloudWatch, Secrets Manager, ECR, Route 53, and more
- Define and enforce cloud cost optimization strategies and tagging standards
- Build and maintain end-to-end CI/CD pipelines in Bitbucket Pipelines integrated with the Atlassian ALM ecosystem (Jira, Confluence, Bitbucket)
- Design and implement an on-demand environment provisioning system that allows developers and QA to spin up isolated, production-like environments on demand accelerating feature development, testing, and release cycles
- Environments should be branch-scoped or PR-scoped, created and torn down automatically
- Integration with Jira tickets for traceability from issue to live preview environment
- Implement blue/green and canary deployment strategies on ECS Fargate; extend patterns to EKS
- Maintain reusable pipeline templates and enforce standardized build/test/deploy workflows across all repositories
- Establish SLOs, implement centralized logging (CloudWatch / OpenSearch), distributed tracing, and alerting
- Lead incident response for infrastructure-layer events; conduct blameless post-mortems and drive remediation
- Enforce cloud security best practices: least-privilege IAM, network segmentation, secrets management, and vulnerability scanning in CI
- Manage disaster recovery planning, RTO/RPO targets, and regular DR drills
- Serve as the internal platform owner making it easy for application developers to self-serve infrastructure within guardrails
- Collaborate with engineering teams to reduce deployment friction and improve local-to-production parity
- Document infrastructure patterns, runbooks, and architecture decisions in Confluence
- Mentor engineers on DevOps and cloud-native best practices
Requirements:
- 6+ years of DevOps / platform / cloud engineering experience, with at least 2 years in a lead or senior capacity
- Deep, hands-on AWS expertise: ECS (Fargate), RDS (MySQL), VPC, IAM, ALB, CloudWatch, S3, ECR, Secrets Manager
- Proficiency in infrastructure-as-code with Terraform (preferred) or CloudFormation
- Proven experience building CI/CD pipelines with Bitbucket Pipelines; familiarity with the broader Atlassian ALM suite (Jira, Confluence)
- Demonstrated experience building or operating on-demand / ephemeral environment systems at scale
- Strong scripting skills: Bash, Python, or similar
- Experience with containerization: Docker, ECS task definition management, container security
- Solid networking fundamentals: DNS, TLS, VPC routing, security groups, load balancing
- Strong written and verbal communication skills; ability to document and present architecture decisions clearly
- Kubernetes administration experience (EKS preferred) and knowledge of Helm, Kustomize, or GitOps tooling (ArgoCD / Flux)
- Experience with AWS Service Catalog, Control Tower, or AWS Organizations for multi-account governance
- Familiarity with database migration tooling (Flyway) and RDS Proxy
- Knowledge of FinOps practices and AWS Cost Explorer / Savings Plans optimization
- AWS certifications: Solutions Architect, DevOps Engineer - Professional
- Experience with feature-flag systems, progressive delivery, or trunk-based development at scale