Galvanick is a startup focused on protecting the industrial world against cyber attacks with their threat detection platform. The Infrastructure Engineer will be responsible for building internal tooling and pipelines, managing Kubernetes clusters, and maintaining the observability stack to ensure system reliability and performance.
Responsibilities:
- Design, deploy, and maintain Kubernetes clusters across cloud and on-premise environments, ensuring high availability and scalability
- Build and maintain our observability platform using Prometheus, Grafana, and Alertmanager to provide comprehensive metrics collection, visualization, and alerting
- Define and track SLIs/SLOs, create actionable dashboards, and implement alerting strategies to proactively detect issues such as sensor failures or data gaps
- Embrace automation and leverage infrastructure management tools (e.g., Terraform, Ansible, CDK) to improve deployment pipelines and eliminate recurring issues
- Develop and maintain internal tooling to enhance the efficiency and productivity of our development processes
- Design and implement CI/CD pipelines that streamline our workflows
- Conduct regular maintenance and troubleshooting activities, ensuring the reliability of our systems and equipment