Crunchyroll is a company dedicated to delivering the art and culture of anime to a global community. They are seeking a Senior DevOps Engineer to manage and scale systems for their streaming platform, collaborating with delivery teams to ensure efficient and reliable services while driving projects forward and improving operational practices.
Responsibilities:
- Manage and scale the systems and services that power our streaming platform
- Work with delivery teams and lead projects, driving them forward to completion
- Ensure that the platform processes and best practices are being followed
- Develop tools and systems that support services and facilitate high-quality software delivery
- Ensure that delivery teams' systems are efficient and reliable
- Guide teams to better quality and practices including observability and troubleshooting of services & incident management
- Take initiatives to improve overall processes and introduce new technologies & processes when needed
- Share knowledge across teams
Requirements:
- At least a Bachelor's degree in software engineering, computer science, information technology, information systems, or similar
- 8+ years of experience working in production-level environments as a DevOps/SRE
- Proficient in AWS cloud technologies, coupled with Infrastructure as Code (IaC) such as Pulumi and Terraform
- 3+ years of experience managing containerized infrastructure in ECS or Kubernetes
- Experience in GitOps practices and CI/CD
- Experience managing Linux based infrastructure
- Experience with observability and instrumentation of services so that efficient dashboards and alerts are created
- Familiarity with relational databases like PostgreSQL, MySQL and non-relational databases like MongoDB, DynamoDB
- Experience with CDN and caching technologies including AWS CloudFront
- Experience working with and guiding Software Engineers in production best practices, sharing knowledge across teams and teammates
- Experience in at least two or more scripting languages including TypeScript
- Able to accommodate periodic off-hour meetings with the international team members
- Excellent communication skills and ability to collaborate with stakeholders
- Participate in an on-call and developer support rotations
- Knowledge of streaming video delivery, video encoding workflows, video origin architecture, and issues of scale for video delivery
- Proficiency in TypeScript
- Familiarity with big data and payments domain
- Understand security concepts such as cryptography, authentication, authorization, and security protocols
- Familiarity with Datadog
- Familiarity with GitHub Actions