Independence Pet Holdings is a corporate holding company managing a diverse portfolio of modern pet health brands and services. They are seeking a DevOps Engineer to design and operate their Azure application platform, ensuring high availability and resilience while implementing CI/CD pipelines and infrastructure as code strategies.
Responsibilities:
- You will design and operate the application layer of our Azure platform, including:
- Multi-region deployment patterns across Azure regions for high availability and resilience
- Azure Kubernetes Service (AKS) for core application workloads
- Azure Functions for event-driven and serverless components
- Azure Service Bus for asynchronous messaging and decoupled architectures
- Azure API Management (APIM) for secure and scalable API exposure
- Azure Front Door for global routing, performance optimization, and failover
- You will ensure these services are deployed in a consistent, production-ready manner with clear standards that teams can follow
- You will build and evolve our application delivery model using Azure DevOps (ADO) and GitOps principles
- Design advanced CI/CD pipelines with automated testing, security gates, and approval workflows
- Implement zero-downtime deployment strategies (Blue/Green, Canary, rolling)
- Standardize branching and release models to support parallel development and fast, safe releases
- Create reusable pipeline templates and deployment patterns that teams can adopt easily
- Drive consistent, repeatable deployment processes across services and environments
- Your goal is to make deployments boring, predictable, and safe
- Define and maintain Infrastructure-as-Code using Terraform for Azure resources
- Partner with the Infrastructure team to ensure networking, identity, and foundational services meet application platform needs
- Own application configuration strategy using Azure App Configuration and Key Vault
- Eliminate configuration drift across Dev, Test, Staging, and Production
- Ensure environment parity to reduce release risk
- Design for regional failover and high availability at the application layer
- Implement and tune monitoring using Azure Monitor and Application Insights
- Establish actionable alerting that reduces noise and improves signal
- Lead root cause analysis (RCA) for platform and deployment-related incidents
- Continuously improve resilience, performance, and recovery processes
- Create self-service templates, documentation, and golden paths for teams
- Coach teams on deployment patterns, observability, and cloud-native practices
- Act as a bridge between Software Engineering and Infrastructure teams
- Help teams design systems that are deployable, scalable, and cost-efficient
- Build cost awareness into deployment patterns and architectural standards
- Optimize AKS, serverless, and messaging usage for performance per dollar
- Partner with leadership to monitor and manage cloud spend
- Prevent waste through right-sizing, lifecycle policies, and automation
Requirements:
- Experience with Azure Cloud Application Platform
- Knowledge of multi-region deployment patterns across Azure regions for high availability and resilience
- Experience with Azure Kubernetes Service (AKS) for core application workloads
- Experience with Azure Functions for event-driven and serverless components
- Knowledge of Azure Service Bus for asynchronous messaging and decoupled architectures
- Experience with Azure API Management (APIM) for secure and scalable API exposure
- Experience with Azure Front Door for global routing, performance optimization, and failover
- Experience with Azure DevOps (ADO) and GitOps principles
- Ability to design advanced CI/CD pipelines with automated testing, security gates, and approval workflows
- Experience implementing zero-downtime deployment strategies (Blue/Green, Canary, rolling)
- Ability to standardize branching and release models to support parallel development and fast, safe releases
- Experience creating reusable pipeline templates and deployment patterns
- Ability to drive consistent, repeatable deployment processes across services and environments
- Experience defining and maintaining Infrastructure-as-Code using Terraform for Azure resources
- Ability to partner with the Infrastructure team to ensure networking, identity, and foundational services meet application platform needs
- Experience owning application configuration strategy using Azure App Configuration and Key Vault
- Ability to eliminate configuration drift across Dev, Test, Staging, and Production
- Experience designing for regional failover and high availability at the application layer
- Experience implementing and tuning monitoring using Azure Monitor and Application Insights
- Ability to establish actionable alerting that reduces noise and improves signal
- Experience leading root cause analysis (RCA) for platform and deployment-related incidents
- Ability to continuously improve resilience, performance, and recovery processes
- Experience creating self-service templates, documentation, and golden paths for teams
- Ability to coach teams on deployment patterns, observability, and cloud-native practices
- Experience acting as a bridge between Software Engineering and Infrastructure teams
- Ability to help teams design systems that are deployable, scalable, and cost-efficient
- Experience building cost awareness into deployment patterns and architectural standards
- Ability to optimize AKS, serverless, and messaging usage for performance per dollar
- Experience partnering with leadership to monitor and manage cloud spend
- Ability to prevent waste through right-sizing, lifecycle policies, and automation