Glassbox is a leading force in shaping digital experiences, helping organizations uncover digital issues and enhance customer experiences. They are looking for a DevOps Engineer to work with cutting-edge technologies, focusing on building scalable, globally distributed infrastructure and maintaining high-scale production environments.
Responsibilities:
- Work with a diverse set of technologies, simplifying complex solutions
- Collaborate closely with multiple teams across the organization
- Be part of a team responsible for designing, optimizing, and maintaining a high-scale production environment that handles massive traffic loads with high complexity
- Architect, deploy, and maintain robust and scalable cloud infrastructures on AWS and Azure
- Develop and optimize CI/CD pipelines to support automated deployment, testing, and scaling across multiple environments
- Implement and manage monitoring, logging, and alerting solutions across cloud platforms to ensure application health and performance
- Provide advanced troubleshooting and resolution for infrastructure issues in production, development, and testing environments
Requirements:
- 4+ years of experience in a DevOps or related engineering role, with a strong background in AWS and cloud-native environments
- Proven ability to design, manage, and maintain high-scale production systems, ensuring reliability, performance, and scalability
- Deep expertise in cloud technologies and application security, with a strong focus on best practices for securing resilient, scalable, and cost-efficient architectures
- Extensive experience with containerization and orchestration technologies, including Docker, Kubernetes, and Helm
- Proficiency in automation tools such as Terraform, and hands-on experience with CI/CD pipelines using tools like Jenkins
- Strong knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, Loki) to ensure system health, optimize performance, and proactively detect issues
- Proficiency in scripting languages (e.g., Node.js, Bash) to automate workflows and enhance operational efficiency
- Excellent communication, collaboration, and documentation skills, with a proactive approach to problem-solving
- Passionate about learning new technologies and tackling complex challenges in a fast-paced environment
- Hands-on experience managing big data infrastructure, optimizing performance and scalability for data-intensive applications
- Proficiency in database technologies, including Cassandra, Elasticsearch, ClickHouse, and PostgreSQL HA
- Expertise in Kafka cluster administration, including cross-region replication and high-availability configurations
- Experience with MLOps workflows and infrastructure, including model deployment, monitoring, and scaling in production environments