Leidos is an industry and technology leader serving government and commercial customers with smarter, more efficient digital and mission innovations. The Site Reliability Engineer will focus on the reliability, performance, and scalability of complex distributed systems, developing automated testing frameworks and maintaining system performance. Responsibilities include proactive incident management, leading software deployments, and enhancing service quality for end-user services.
Responsibilities:
- Utilize metrics and tools like Aternity to monitor end-user performance and proactively identify potential issues
- Develop strategies to address recurring incidents and improve system reliability
- Collaborate with engineering and operations teams to implement automated solutions for incident prevention
- Lead the planning, coordination, and execution of software deployments across end-user devices
- Ensure deployments are completed on time, with minimal disruption to end users
- Work with stakeholders to prioritize deployment schedules and align with organizational goals
- Analyze service performance metrics to identify areas for improvement
- Develop and implement initiatives to enhance the quality of end-user services
- Advocate for automation, proactive monitoring, and best practices to improve service delivery
- Define and maintain a product vision and roadmap for End User/Seats Services, aligned with organizational objectives
- Translate business and operational requirements into actionable features and technical requirements
- Manage the product backlog, prioritize user stories, and ensure alignment with strategic goals
- Serve as the primary point of contact between the End User/Seats Services team and business stakeholders
- Create user stories and acceptance criteria that clearly communicate stakeholder needs to the development team
- Participate in team demos, retrospectives, and continuous improvement initiatives
- Ensure clear documentation of product requirements, progress, and updates for stakeholders
- Publish strategies, implementation guides, and maintenance documentation for End User/Seats Services