Lirio is a technology/software company that provides expertise in behavioral science, data science, and machine learning. The Senior Cloud Engineer will be responsible for defining and supporting cloud infrastructure architecture and engineering practices, as well as providing technical support to various engineering teams.
Responsibilities:
- Support the design and planning of cloud-agnostic solutions capable of being deployed to multiple cloud providers and customer public cloud and private data centers
- Design, implement, test, deploy, maintain, and support cloud-native and cloud-agnostic self-healing infrastructure across environments, multiple cloud providers, and customer data centers as a top-level contributor
- Align cloud infrastructure with HITRUST & SOC2 compliance and support audits
- Plan, implement, and support a developer platform to improve self-service
- Collaborate with development teams to evaluate and identify optimal cloud services and infrastructure solutions and address issues as they arise
- Review existing systems and offer recommendations for improvement
- Identify, analyze, mitigate and resolve infrastructure issues, vulnerabilities, and application deployment issues through monitoring, scanning, observability, and processes
- Support and improve Lirio’s engineering practices including an emphasis on quality and security
- Document decisions, work product, and cloud practices
- Review code, designs, and contributions from others, promoting stability, security, compliance, scalability, readability, and maintainability
- Write clean and maintainable code/infrastructure-as-code (IaC)
- Assist in project planning, estimation, and resource allocation
- Help manage infrastructure spend efficiency
- Implement and support build & CI/CD pipeline engineering efforts as needed
- Pursue continuous learning through individual study, online courses, product documentation, and community resources to bring innovation to the technical organization
- Design, implement, and support stateful database infrastructure including managed cloud databases and Kubernetes-native database operators
Requirements:
- 5+ years in software product development, cloud engineering, and operations across public or private clouds (AWS & Azure; OCI a plus) including hands-on work with secure, scalable enterprise systems (Java or Python)
- Bachelor's Degree in Computer Science or related field required
- Articulated ability to help others, collaborate cross-functionally, thrive in a fast-paced environment, and stay current on emerging cloud technologies and industry trends
- Experience supporting infrastructure for a health tech or healthcare related company with a software product
- Strong proficiency with containerization (Docker, Kubernetes), the Linux operating system, cluster management/administration, and deployment tooling (Helm, ArgoCD) for distributed, event-driven architectures including stateful workloads and database operators
- Strong proficiency in DevSecOps principles and automation frameworks (Terraform, Ansible, Azure Key Vault), scripting (Python, Bash), and CI/CD pipelines (Azure DevOps – Gradle, Poetry, JFrog)
- Experience designing and securing network infrastructure end-to-end (in-transit and at-rest) and meeting compliance frameworks (HITRUST, PCI) with SOC2 attestation
- Skilled at using observability platforms (Datadog or similar) to review operational metrics and drive improvements
- Demonstratable command of a programming language where Java, Python, and Go are preferred though demonstrable, articulable skills with TypeScript, C#, or other languages will be considered
- Software engineering background or previously held a software engineering role which may include platform engineering or reliability engineering
- Proficient with Microsoft Entra ID/AD
- Ability to quickly learn company terminology and processes
- Self-starter with strong time management and work planning skills
- Experience deploying and supporting the same software on multiple clouds at the same time
- Experience with microservices and eventually consistent architectures
- Experience with event-driven architectures, asynchronous messaging, and Apache Kafka
- Experience running highly available, cloud-agnostic, production data and event infrastructures in Kubernetes
- SRE experience with large scale cloud-based systems
- Experience building infrastructure for Data and Machine Learning teams
- Experience deploying, operating, and migrating relational database infrastructure including Amazon RDS and CloudNativePG (CNPG) on Kubernetes, including high availability configuration, backup/restore, and monitoring for stateful workloads
- Master's Degree in Business Analytics, Business Administration, or similar preferred