Design and document end-to-end cloud architectures across AWS, Azure, and Alibaba Cloud.
Lead the development of highly available, scalable, and secure multi-region cloud environments, including disaster recovery strategies with well-defined RTO and RPO objectives.
Establish and enforce cloud governance frameworks, including tagging, identity and access control, and cost-optimized infrastructure designs.
Own the definition and maintenance of CI/CD infrastructure, including support for blue/green deployments, canary releases, and infrastructure-as-code (IaC) best practices.
Collaborate closely with DevOps, Site Reliability Engineering (SRE), Application, and Security teams to align architectural decisions with business needs.
Define and oversee observability standards, ensuring robust monitoring, alerting, and telemetry pipelines for production workloads.
Continually evaluate and recommend the adoption of cloud-native technologies, tooling, and industry best practices to improve system resilience and maintainability.
Serve as an escalation point for critical production issues related to cloud infrastructure.
Requirements
Proven experience (8+ years) in cloud architecture, with strong hands-on expertise across AWS, Azure, and Alibaba Cloud.
Deep understanding of high availability, scalability, resiliency, and security in cloud-native environments.
Proficiency with Infrastructure as Code tools such as Terraform, CloudFormation, or Pulumi.
Strong experience in setting up and managing CI/CD pipelines (e.g., Jenkins, GitHub Actions, GitLab CI).
Expertise in disaster recovery planning, multi-region failover, and system hardening.
Solid background in networking, IAM, containerization (Kubernetes), and service mesh technologies.
Experience in collaborating with geographically distributed teams and managing stakeholders across multiple time zones.
Strong analytical, documentation, and communication skills.