Smile Digital Health is dedicated to improving global health through innovative health data solutions. The Cloud Engineer will design, build, and maintain cloud-hosted services and platforms to support the company's SaaS offerings, ensuring they are scalable, reliable, and secure.
Responsibilities:
- Develop, test, and maintain cloud-native RESTful APIs using Python, GoLang/ Go and JavaScript/TypeScript
- Build and manage Docker container images, ensuring best practices for image build, security, cost efficiency and deployments
- Author, maintain, and version HELM charts for deploying services into Kubernetes/ Openshift Environment, including configuration/ annotation values, templates, upgrades and rollback support
- Participate in (or build) CI/CD pipelines to automate deployment, testing, monitoring, and roll-back of SaaS services
- Monitor the performance, reliability, and scaling behavior of services; identify bottlenecks and implement solutions (caching, concurrency, resource limits, horizontal auto-scaling)
- Develop and maintain serverless SaaS components using AWS CDK, Lambda, API Gateway, and ECS/Fargate
- Ensure high availability and fault tolerance through Kubernetes features such as readiness and liveness probes, sensible resource requests/limits, and proper namespace/networking isolation
- Implement and enhance CI/CD pipelines with automated testing, versioning, and documentation publishing
- Manage in-cluster dependencies such as PostgreSQL, Kafka, Grafana, and ElasticSearch using Helm or Operators
- Collaborate with security, operations, and infrastructure teams to ensure deployments meet security, compliance, logging, monitoring, and auditing requirements
- Troubleshoot and debug issues in production and staging environments. Provide root cause analysis and propose corrective measures
- Maintain clear documentation of architecture, deployment procedures, configuration, and usage of cloud tools
Requirements:
- 3-5+ years of professional experience in cloud development or infrastructure engineering roles
- Bachelor's degree in Computer Science, Software Engineering, or a related discipline (or equivalent work experience)
- Proven experience working to design and build SaaS applications and multi-cloud solutions
- Understanding of DevOps and hands-on experience working with DevOps tooling including, but not limited to, Gitlab CI, GitHub Actions, Azure Devops etc
- Working knowledge of public cloud SDKs/APIs (Azure SDK, AWS SDK, GCP SDK, etc.) for interacting with provider services (storage, identity, secrets, etc.). Boto3 and AWS CDK preferred
- Strong experience building APIs (REST or GraphQL) in Python and/or JavaScript/ TypeScript
- Solid knowledge of Docker, containerization best practices, and experience deploying containers in Kubernetes / OpenShift
- Experience authoring and maintaining Helm charts and Operators for complex deployments
- Familiarity with monitoring/logging/tracing tools (e.g., Prometheus, Grafana, OpenTelemetry, Elasticsearch/Loki, etc.)
- Strong understanding of cloud hosting (AWS and Azure preferred) and cloud storage, secrets, and security services
- Familiarity with GitOps tooling and principles
- Working experience with Hashicorp Terraform to build and deploy complex environments
- Ability to work in cross-functional teams and communicate effectively, both written and verbal
- Self-motivated, detail-oriented, with strong problem solving and troubleshooting skills