The Judge Group is seeking a DevOps / Platform Engineer to join their fully remote engineering team. In this role, you will focus on building a seamless developer experience and maintaining a resilient, scalable cloud environment while automating processes and ensuring system reliability.
Responsibilities:
- Kubernetes Orchestration
- Manage, scale, and troubleshoot Azure Kubernetes Service (AKS) clusters
- Ensure containerized workloads are healthy and efficiently binned
- Infrastructure as Code (IaC)
- Treat infrastructure as software
- Use Terraform to provision and manage the Azure environment
- Deployment Excellence
- Design and maintain CI/CD pipelines using GitHub Actions
- Implement GitOps-based CD using ArgoCD
- Observability & Reliability
- Configure Datadog for deep system insights
- Build dashboards and proactive monitors
- Track DORA metrics and SLOs to measure engineering health
- Data Streaming
- Manage Kafka operations (via Strimzi or equivalent)
- Ensure event-driven systems remain performant and reliable
- Cloud Governance
- Oversee Azure services including:
- Key Vault (secret management)
- Container Registry (ACR)
- Storage Accounts
- Service Bus
Requirements:
- Deep understanding of the Azure ecosystem, including networking and security
- Strong knowledge of AKS internals, networking, ingress, and resource optimization
- Proficiency with Terraform
- Comfort with Git-centric workflows and automation-first practices
- Experience creating actionable Datadog alerts and defining SLOs
- Ability to work closely with developers to explain platform behavior and system interactions