CloudDistributed SystemsDockerGoogle Cloud PlatformGrafanaKubernetesTerraformMachine LearningGCPGoogle CloudHelmDatadogNew RelicAppDynamicsGitHubCI/CDRemote Work
About this role
Role Overview
Create and support CI/CD tooling for IaC and services in Github
Create monitoring and alerting systems to track the performance and health of cloud services; working closely with application teams to develop a common logging framework
Identify opportunities for optimization in cloud services and implementing strategies to improve efficiency and cost-effectiveness.
Developing automated processes for deploying and managing cloud services utilizing terraform, Helm Charts and additional automation tooling.
Work closely with security teams to implement security best practices.
Work as part of a cross-functional guild to develop and maintain operational standards and best practices. Reduce operational risk in our business.
Stay up-to-date with the latest cloud technologies.
Reduce friction to deliver software within Google Cloud
Requirements
Minimum of 2 years of relevant experience.
Strong understanding of cloud infrastructure, networking, containerization, and cloud computing concepts, ideally on GCP.
Experience with various monitoring tools like Google Monitoring Suite (preferred), AppDynamics, Datadog, New Relic, Grafana, etc.
A strong interest in data science/machine learning concepts.
Comfortable working in a dynamic, fast-paced environment.
Experience building large-scale platforms, solving problems including scalability, reliability, observability, validation, cost efficiency.
Experience building or operating high performance distributed systems.
Hands-on experience in complex system design and data pipeline and architectures, scale and performance, tuning, with good knowledge on Docker and Kubernetes.