INGOT is focused on building high-performing teams and is seeking a Senior DevOps Engineer to execute and refine DevOps strategies. The role involves implementing automation and integration across IT Operations, ensuring alignment with the organization's DevOps methodology.
Responsibilities:
- Design, implement, and maintain scalable CI/CD pipelines using GitHub Actions, following GitOps methodology with ArgoCD to enable reliable and automated Kubernetes deployments
- Automate infrastructure provisioning and configuration using Terraform, CloudFormation, and Ansible
- Manage and optimize AWS cloud infrastructure, focusing on automation, security best practices, and cost efficiency
- Deploy and operate Kubernetes (EKS) workloads using Helm and Kustomize, ensuring consistent configuration across environments
- Implement and maintain monitoring, logging, and alerting systems using Prometheus, Grafana, Zabbix, OpenSearch, and Opsgenie
- Support and maintain database and data platform infrastructure (e.g., PostgreSQL, MongoDB, Redshift, ClickHouse) ensuring reliability, monitoring, and backup strategies
- Troubleshoot production issues and participate in incident response and post-mortem analysis to improve system reliability and resilience
- Collaborate with development and data teams to improve release processes and deployment workflows
- Maintain technical documentation and automate operational processes such as backup and disaster recovery
- And any other duties related to the role
Requirements:
- 3+ years of experience in DevOps, Cloud Engineering, or similar roles
- Strong experience with CI/CD pipeline design and automation using tools such as GitHub Actions, ArgoCD, and GitOps workflows
- Hands-on experience with AWS cloud services and infrastructure automation
- Proficiency with Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Ansible
- Experience managing and operating Kubernetes clusters (preferably EKS) and deploying applications using Helm or Kustomize
- Strong understanding of containerization using Docker and experience working with Kubernetes
- Experience implementing monitoring, logging, and alerting solutions using tools such as Prometheus, Grafana, Zabbix, OpenSearch, or similar platforms
- Solid understanding of Linux systems, networking concepts, and cloud security best practices
- Experience with incident management, troubleshooting production systems, and root cause analysis
- Strong collaboration and communication skills, with experience working closely with development and platform teams
- Experience with databases and data platforms such as Redshift, PostgreSQL, MySQL, MongoDB, ClickHouse, and Kafka is a plus