CyberArk, a Palo Alto Networks company, is the global leader in identity security, trusted by organizations around the world to secure human and machine identities in the modern enterprise. They are seeking a Senior Site Reliability Engineer to manage AWS infrastructure, automate cloud-based deployments, and ensure architecture meets availability and recoverability requirements.
Responsibilities:
- Management of AWS infrastructure components such as VPCs, EC2, EKS, S3, tagging schemes, CloudFormation, etc
- Deployment and management automation of cloud-based infrastructure and software
- Working with configuration management tools in both Windows and Linux - Terraform, Ansible, CloudFormation
- Ensuring cloud-based architecture meets availability and recoverability requirements
- Architecture and implementation of cloud-based monitoring, alerting and reporting – Datadog, CloudWatch, ELK, Grafana
- Develop tools to enable teams for greater output and reliability
Requirements:
- B.S. in Computer Science or equivalent experience
- Minimum 2 years of experience managing AWS infrastructure
- Minimum of 5 years of experience with systems engineering and software development
- Solid understanding/experience of containerization services such as Docker
- Working knowledge of open-source tools such as Terraform, Grafana, Logstash, Elasticsearch, Ansible
- Solid understanding/experience of web services, databases and relating infrastructure/architectures
- Solid understanding of backup/restore best practices
- Strong level of expertise programming in C# / C++ / Java / Python or equivalent language
- Excellent Troubleshooting Skills
- Experience supporting an enterprise-level SaaS environment
- Security Experience a plus