Litmos is an established leader in eLearning solutions, developing platforms used by over 30 million people globally. They are seeking a Lead Cloud Engineer to be the technical authority responsible for the architecture and reliability of their cloud platform, driving modernization and operational excellence while mentoring engineers.
Responsibilities:
- Define and evolve the architecture of the cloud platform on Microsoft Azure
- Own the technical roadmap for the cloud platform and guide modernization initiatives
- Serve as the senior technical escalation point for complex infrastructure and production issues
- Build and maintain infrastructure using Infrastructure-as-Code (Bicep preferred, ARM, or Terraform)
- Define reusable platform patterns and infrastructure modules that enable product teams to deploy safely and efficiently
- Improve CI/CD pipelines, deployment automation, and operational consistency across environments
- Establish reliability practices including observability, monitoring, alerting, SLOs, and incident response
- Design highly available systems capable of supporting global SaaS workloads
- Lead cloud cost optimization initiatives and provide FinOps leadership across the organization
- Implement tooling and reporting that provide transparency into infrastructure usage and cost drivers
- Partner with engineering, product, and security teams to ensure infrastructure meets scalability, security, and compliance requirements
- Mentor engineers and help develop strong cloud engineering capabilities across the organization
- Leverage AI-assisted engineering tools to accelerate infrastructure automation, operational insights, and system optimization
Requirements:
- 7+ years of experience in cloud engineering, platform engineering, or DevOps roles with increasing technical responsibility
- Experience operating production infrastructure for SaaS platforms
- Strong expertise with Microsoft Azure infrastructure and architecture
- Experience working with Infrastructure-as-Code tools such as Bicep, ARM templates, or Terraform
- Strong scripting or programming skills in PowerShell, Python, C#, or similar languages
- Experience designing and operating highly available distributed systems
- Ability to lead technically while remaining hands-on with infrastructure and automation
- Strong collaboration and communication skills
- Azure certifications such as AZ-104 (Azure Administrator) or AZ-305 (Azure Solutions Architect)
- Experience with Azure Kubernetes Service (AKS) and containerized workloads
- Experience working across multiple public cloud providers (Azure and AWS)
- Experience planning or executing cloud migration or platform modernization initiatives
- Familiarity with SRE practices and modern observability platforms
- Experience leveraging AI-assisted engineering tools and emerging AI-driven operational platforms