We help client teams use DevOps practices to create more observable, sustainable, and predictable environments by integrating operations capabilities into development teams.
Delivering primary DevOps solutions to clients across: Cloud Architecture and Deployment in at least one major cloud provider
Hands-on experience running production services on Kubernetes and managed container platforms, including rollout strategies, autoscaling, and observability
Able to weigh tradeoffs between container orchestration (k8s vs. ECS) and serverless container platforms when advising clients
Comfortable with event-driven serverless (Lambda, Cloud Functions) and knowing when it's the right tool versus a long-running container
Infrastructure as Code
Configuration Management
CI/CD Pipelines
Monitoring and Observability
Creating high-quality infrastructure to meet the needs of its users and businesses
Applying security best practices in deployment pipelines and cloud environments
Helping clients achieve Service Level Agreements and Service Level Objectives by providing observable infrastructure
Implementing high-availability and disaster recovery architecture
Identifying technology, communication, and process issues and proposing improvements
Sharing best practices for cloud architecture that are fault-tolerant, highly available, and cost-effective for the client’s business
Mentoring by sharing experience and knowledge with client developers and operations teams so they are well-positioned to succeed, even long after we're gone
Collaborating internally with other Test Double agents on infrastructure best practices
Learn new frameworks, languages, tech, and techniques to adapt to changing client needs
Communicate openly and honestly with everyone, even if the news will not be positively received
Requirements
8+ years of experience in software development
3+ years of experience in DevOps, cloud computing, or operations
3+ years of experience in consulting
Strong understanding of Configuration Management tools like Ansible, Chef, or Puppet
Strong understanding of Infrastructure as Code tools like Terraform
CI/CD Pipelines like Jenkins, CircleCI, GitHub Actions, GitLab CI/CD
Demonstrated ability to direct AI in delivery—defining problems, applying quality checks, and producing consistent results, with examples of improving team workflows
Containerized deployment strategies like Kubernetes, AWS Elastic Container Service, Docker
Observability and monitoring tools like CloudWatch, Grafana, and DataDog
Low ego, high emotional intelligence (EQ), and a mindset of continuous improvement
Experience leading teams in decomposing work and maintaining a healthy backlog that is valuable to the business
Experience balancing competing priorities and influencing teams towards high-quality software development practices
Ability to communicate effectively across different levels or positions within an organization
Proficiency in designing, architecting, and refactoring systems of moderate complexity worked on by teams of 10+
Ability to resolve conflicts and issues within the delivery team
Experience in mentoring and leading the technical direction of software engineers
Expertise in designing and delivering systems to production in the use of one or more of the following: Ruby, Go, Python, JavaScript/Typescript.
Tech Stack
Ansible
AWS
Chef
Cloud
Docker
Grafana
JavaScript
Jenkins
Kubernetes
Puppet
Python
Ruby
Terraform
TypeScript
Go
Benefits
Remote First
Work from anywhere, travel required for critical client and company functions
Time off: 5 weeks flexible time off (vacation and sick time) + 10 Paid Holidays, 2 week sabbatical after 5 years
Company Ownership: ESOP Employee stock ownership program
Test Double is 100% employee owned
Family Support: 8 weeks paid parental leave at 100% of salary, plus additional unpaid
Retirement: Company Contribution of 3% of salary to (401k)
Continuing Education: 1 week of conference attendance (and up to $3,000 of expenses)