Design, implement, and maintain the on-premise Kubernetes platform to ensure high availability, scalability, and security for critical services.
Establish and enforce best practices for Kubernetes cluster lifecycle management, including upgrades, patching, and configuration management using tools like Helm, Kustomize, or GitOps (ArgoCD/Flux).
Develop and manage core infrastructure components such as service mesh, ingress controllers, logging, and secret management within the on-premise environment.
Troubleshoot and resolve complex issues related to the Kubernetes control plane, networking, storage, and underlying hardware/OS in a mission-critical, on-premise setting.
Collaborate closely with software engineering teams to understand their needs, implement improvements, maintain security policies, network segmentation, and access controls within the Kubernetes environment.
Requirements
Strong knowledge of build systems and other tools in the software development cycle, like: git, linting, GitHub, code review, testing frameworks and deployment frameworks.
Strong knowledge of CI/CD best practices and principles. Direct experience with Buildkite is a plus.
Experience developing tools and services in Python or Go.
Experience with Linux operating systems in both day to day development and operational environments.
Experience with IaC technologies such as Ansible, Chef, Puppet, or Terraform.
Knowledge of AWS or other similar cloud platforms.
Knowledge of on-call support processes, incident management and monitoring tooling.
Experience with server hardware, including but not limited to rack-mounted servers, expansion cards, storage drives and power supplies.
Experience with out of band management solutions including but not limited to KVM, Dell iDRAC and/or HP iLO.
This position requires in-person work with on-premise servers in our Richmond, CA facility 1-2 days/week. Candidates must be willing to be hands-on. This includes but not limited to racking servers, configuring operating systems, assembling servers with individual components.