Lead a team responsible for managing the complex challenges of scale unique to CrowdStrike, leveraging your expertise in software engineering, systems design, and automation.
Play a critical role in ensuring that our services maintain the highest levels of reliability, uptime, and performance, meeting the needs of our customers while continuously improving our systems.
Work on meaningful projects while providing support and mentorship to your team, enabling them to learn, grow, and make a lasting impact in the cybersecurity landscape.
Requirements
10+ years of software engineering experience with significant focus on reliability engineering, platform infrastructure, and production operations at scale.
3+ years of hands-on management experience overseeing SRE/Platform engineering teams, including incident command and reliability ownership.
Deep understanding of SRE principles including SLOs, SLAs, SLIs, error budgeting strategies applied to large-scale distributed systems.
Driving system reliability by blending software engineering principles with AI-driven automation, moving from reactive firefighting to proactive, automated operations.
Proficiency in at least one cloud environment (AWS, Azure, GCP) with emphasis on multi-region architecture, cloud-native reliability patterns, and security-first cloud design.
Proven experience owning reliability for high-throughput distributed systems processing millions of events per second, including capacity planning, traffic management, and load shedding strategies.
Strong incident management background including leading major incident response, facilitating blameless postmortems, and driving systemic reliability improvements.
Demonstrated ability to build, operationalize, and maintain highly scalable, security-critical systems with zero tolerance for data loss or downtime.
Bachelor's degree in Computer Science or related field, or equivalent work experience.
Ability to work 2+ days per week in our Sunnyvale Offices
Tech Stack
AWS
Azure
Cloud
Cyber Security
Distributed Systems
Google Cloud Platform
Benefits
Market leader in compensation and equity awards
Comprehensive physical and mental wellness programs
Competitive vacation and holidays for recharge
Paid parental and adoption leaves
Professional development opportunities for all employees regardless of level or role
Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections