CAKE.com is a unicorn product-based company with a global presence in North America, Europe, and Asia. They are looking for an experienced SRE to scale and secure their rapidly growing infrastructure, automating critical processes and ensuring a seamless experience for users.
Responsibilities:
- Make sure the infrastructure is scalable and can handle high traffic volume (100,000 requests per second)
- Define and deploy monitoring, alerting, and logging systems
- Make sure the system is secure and there are no vulnerabilities
- Respond to and resolve production incidents, conducting thorough post-mortems
- Monitor server logs, watching out for abnormalities
- Design, manage and maintain tools to automate operational processes
Requirements:
- 5+ years of relevant work experience
- Working experience with AWS, Docker, Git, CI/CD tools like Gitlab CI, Jenkins, etc
- Experience with IaC tools like Terraform, CloudFormation, Ansible, Puppet, Packer
- Proficiency with Linux and other Unix based systems (including writing shell scripts)
- Experience setting up build automation and repositories
- Excellent understanding of security and safety best practices
- Bachelor's degree in Computer Science or equivalent work experience
- Excellent written and verbal English communication skills
- Ability to work with mixed US and EU based teams
- Working experience with Hashicorp stack (Nomad, Consul, Vault)
- Working experience with MongoDB, MySQL, PostgreSQL and distributed architecture
- Experience with agile development methodologies (behavior-driven development, continuous integration, and delivery, executable documentation)
- Proficiency with OOP and Java