Own and evolve the core Fail Operational metric framework for quantifying, severity-ranking, and systematically driving down autonomous-mission stoppage and degradation events, including category definitions, classification methodology, and triage criteria
Lead Fail Operational metric target-setting for new and existing milestones, authoring supporting documentation and consolidating inputs across metric categories
Drive the evolution of the Fail Operational metric architecture, ensuring alignment with current priorities and operational needs
Develop process for reviewing observed events, mapping to seen failure modes or confirming as new
Partner cross-functionally with Software, Hardware, and Operations teams to escalate issues, identify mitigations, and support delivery of features that improve performance metrics
Communicate complex metric concepts, data analyses, and actionable insights to diverse stakeholders and executives

B.S. or higher degree in Systems Engineering, Computer Science, Electrical Engineering, Applied Mathematics, or a related field
5+ years of relevant professional experience in systems engineering, data analysis, performance tracking, risk quantification, or fault management for complex and safety-critical systems
Demonstrated experience defining, implementing, and managing quantitative metrics and data-driven frameworks, with proficiency in probability, statistics, and Python for large-scale data analysis
Strong understanding of fault detection, categorization, and severity assessment methodologies, with experience developing technical frameworks, taxonomies, or classification systems
Excellent technical communication and documentation abilities, with a collaborative approach to influencing, driving consensus, and leading cross-functional initiatives

Senior/Staff Systems Engineer, Fail Operational

Key skills