Manage system uptime in cloud-native (AWS, GCP) and hybrid architectures;
Create infrastructure-as-code (IaC) standards that meet security and engineering requirements using one or more technologies (Terraform, cloud CLI scripts, and cloud SDK programming);
Build CI/CD pipelines to create, test, and deploy application and cloud architecture patterns, using the platform (Jenkins) and cloud-native toolchains;
Develop automated tooling to deploy service requests to put changes into production;
Create comprehensive, detailed runbooks to manage, detect, remediate, and restore services;
Troubleshoot and triage issues across complex distributed architecture service maps;
Respond to high-severity application incidents and improve runbooks to reduce MTTR.
Requirements
Bachelor's degree in a technology-related field.
Experience in software engineering, systems administration, database administration, and networking;
Experience developing and/or administering software on public cloud platforms;
Experience monitoring infrastructure, application uptime, and availability to meet functional and performance objectives;
Experience with languages such as Python, Bash, Java, Go, JavaScript and/or Node.js;
Demonstrable cross-functional knowledge of systems, storage, networking, security, and databases;
Systems administration skills, including automation and orchestration on Linux/Windows using Terraform, Chef, Ansible and/or containers (Docker, Kubernetes, etc.);
Cloud certification (plus).
Tech Stack
Ansible
AWS
Chef
Docker
Google Cloud Platform
Java
JavaScript
Jenkins
Kubernetes
Linux
Node.js
Python
Terraform
Go
Benefits
Work arrangement: Remote (home office)
Contract type: Contractor (PJ)
Team hours: Business hours
Important: We value diversity and inclusion and welcome applicants regardless of color, race, religion, gender and gender identity, nationality, disability, sexual orientation, ancestry, age, etc.