Design, implement, and manage scalable AWS cloud infrastructure, with expertise in Azure and GCP considered a plus.
Implement and maintain configuration management solutions using Chef, with knowledge of Ansible and Terraform as a plus.
Design, implement, and manage CI/CD pipelines, ensuring alignment with best practices.
Support production environments by monitoring system performance, availability, and reliability.
Collaborate with teams to streamline deployment processes.
Participate in troubleshooting and resolving infrastructure and deployment issues.
Lead initiatives in cloud migration, cost optimization, and service performance improvement.
Set up and maintain monitoring and observability tools such as Prometheus, Grafana, and Datadog, including alerts and dashboards.
Participate in incident, change, problem, and release management, supporting production environments with on-call duties, including late nights and weekends as required.
Learn and apply security best practices across automation and cloud systems.
Stay updated with industry trends, share knowledge with the team, and mentor junior engineers.
Requirements
2+ years of hands-on DevOps experience, with practical expertise in Infrastructure as Code (IaC) tools such as AWS CloudFormation and Terraform.
B.Sc in Computer Science/Engineering or equivalent.
Practical experience with provisioning, configuration, and automation of systems using tools such as Packer, Chef, and Ansible.
Proficient in scripting or programming languages such as Python and Bash for automation and operational tasks.
Solid understanding of CI/CD concepts; experience with Jenkins, GitLab CI, or similar tools is preferred.
Familiarity with security and compliance standards, including SOC 2 and PCI DSS.
Strong proficiency in cloud technologies, with a focus on AWS; experience with Azure and GCP is a plus.
Proficient with version control systems (e.g., Git).
Working knowledge of containerization and orchestration (e.g., Docker, Kubernetes).
Experience with monitoring and observability tools such as Prometheus, Grafana, and Datadog.
Familiarity with database management systems in cloud environments (e.g., MySQL, PostgreSQL, SQL Server, AWS RDS, Azure CosmosDB).
Excellent communication and collaboration skills, with the ability to work effectively in a distributed or remote team.
Strong problem-solving skills and the ability to work independently.