AnsibleGrafanaKubernetesLinuxMicroservicesPrometheusPythonUnixGitLab CIArgoCDRancherRESTfulGitLabCI/CDRemote Work
About this role
Role Overview
API Implementation: Design, build, and maintain robust, scalable APIs from scratch using Python. Ensure all tooling and internal services strictly adhere to 12-factor application principles.
Kubernetes-Native CI/CD Refactor: Lead the effort to analyze and refactor our existing deployment pipelines and applications to be fully Kubernetes-native, migrating legacy workflows to highly scalable containerized environments utilizing GitLab CI and ArgoCD.
Observability: Implement comprehensive monitoring and alerting solutions leveraging Prometheus, Grafana, and Mimir. Ensure high visibility into system health by defining and tracking critical service metrics.
Infrastructure as Code (IaC) & Automation: Drive infrastructure automation by writing and maintaining modular Ansible playbooks. Manage centralized automation execution, RBAC, and workflows using AWX.
Engineering Rigor: Maintain a high standard of quality by producing rigorous technical documentation and thoroughly testing all code and infrastructure changes prior to deployment.
Operational Excellence: Partner with cross-functional network, storage, and compute teams to ensure services are secure, observable, compliant, and highly resilient.
Requirements
Experience: 5-8 years of professional experience in Platform Engineering, DevOps, or Software Engineering focusing on infrastructure.
Programming Languages: Strong proficiency in Python; highly seasoned in building RESTful APIs from the ground up.
Architecture: Understanding of 12-factor application methodologies and modern microservices architecture and how to implement.
Containerization: Hands-on experience managing application-level Kubernetes workloads, including Deployments, Services, ConfigMaps, Ingresses, and HPA (experience with Rancher is highly preferred).
Pipelines & Tooling: Deep understanding of complex CI/CD pipelines. Proven experience managing and integrating tools like GitLab, ArgoCD, and Harbor.
Automation Configuration: Solid hands-on experience with Ansible and AWX to manage and execute complex automation workflows.
Systems Thinking: Linux/Unix foundation with a basic understanding of networking, storage, and security in an enterprise environment.