Akamai Technologies is a leader in cloud security solutions, serving billions of users globally. The Senior Lead Site Reliability Engineer will ensure optimal performance and uptime of critical security products by analyzing system performance and developing monitoring tools.
Responsibilities:
- Deploying and maintaining the platform and tools used internally
- Developing automation pipelines to support development, testing, and deployment workflows
- Collaborating with our support, operations and engineering teams to investigate and troubleshoot complex problems
- Improving our system monitoring and analysis platform to speed error detection and remediation, enhancing performance and reliability
- Working with Dev and Quality Assurance teams to create more robust solutions, code improvement and stability
- Participating in on-call rotations, guiding restoration and repair of service-impacting issues
Requirements:
- Have 4 years of experience and a Bachelors Degree in Computer Science or a related field
- Have professional experience in a DevOps, SRE, or SysAdmin role, working with large scale distributed systems
- Have experience with any cloud platform (we use Azure heavily) & automation tool such as Jenkins, Terraform
- Have experience developing software using Python, Golang, and familiarity with scripting programming languages
- Have exposure to Container technologies like Dockers and Kubernetes
- Demonstrate communication and presentation skills
- Have a Secret Security Clearance