Albert Bow is seeking a Senior Site Reliability Engineer (Azure) to enhance their Azure infrastructure for a distributed ledger platform aimed at enterprise applications. The role involves designing secure and scalable infrastructure while ensuring high reliability and operational excellence in customer environments.
Responsibilities:
- Design and build secure, scalable Azure infrastructure for production distributed systems
- Develop and manage Terraform-based infrastructure as code
- Translate business and product requirements into technical architecture solutions
- Build and improve platform services, APIs, and infrastructure integrations
- Partner with engineering, product, and security teams on enterprise-ready deployments
- Improve observability, reliability, monitoring, and incident response processes
- Support customer deployments and production infrastructure operations
Requirements:
- Strong experience designing and operating production systems in Azure
- Expertise with Terraform and infrastructure automation
- Experience with Go and/or Python
- Experience building greenfield infrastructure environments
- Strong understanding of distributed systems, platform engineering, or high-availability architectures
- Experience with CI/CD and infrastructure lifecycle automation
- Excellent communication and collaboration skills
- Kubernetes and container orchestration experience
- Prometheus, Grafana, or similar observability tooling
- Experience with Argo, Spacelift, or related orchestration platforms