Collins Aerospace is a strategic unit within the aviation industry, focused on innovative and dynamic product portfolios. They are seeking a Principal Site Reliability Engineer to manage AWS hosted infrastructures, ensuring service availability and performance while driving the migration of applications to the cloud.
Responsibilities:
- YOU will own one or more AWS hosted infrastructures delivering products to our commercial customers in a B2B model
- YOU will manage, enhance, and troubleshoot our AWS environments, ensuring that as we develop new features, we are following best-in-breed industry patterns for continuous integration and deployment of new product/infrastructure releases
- YOU will deliver using an agile pattern with our customers so you should be used to working on features for continuous test and release
- YOU will help and influence other members of the team in using the right AWS tools to scale the features we are delivering for our customers
- YOU will help drive the migration of a significant multi-tenant Connected Aviation web application from on-premise, into the AWS cloud
- YOU will be part of the on-call roster supporting critical service incidents
Requirements:
- Typically requires a degree in Science, Technology, Engineering or Mathematics (STEM) and minimum 8 years prior relevant experience or an Advanced Degree in a related field and minimum 5 years of experience
- U.S. Person - Job requires access to ITAR or 600/500-series EAR information or hardware (directly or indirectly) and the company will not seek an export authorization for this role
- Flexibility to be available for team meetings as early as 9:00 AM Eastern to accommodate team members in the UK and India
- Software engineering background or a good level of coding skills in an OO language (with either .NET or Python being advantageous) in relation to managing infrastructure as code using AWS CDK
- Experience managing AWS ECS clusters and working with systems hosted in SQL based databases like PostgreSQL, and RabbitMQ message brokers
- Experience with Bash and/or Powershell scripting of new parts of infrastructure (or suggesting other ways to bring infrastructure to life)
- Experience with system design consulting, platform management, and capacity planning for AWS & CloudFormation experience
- Experience with Docker containerization & Git
- Having a dual background in both systems infrastructure and software engineering
- Experience with reporting and monitoring using industry standard tools and techniques
- Experience with Azure Dev Ops and Pulumi
- Experience with SCRUM (and Scaled Agile Framework as a bonus)
- A desire to automate as much as possible and proficiency with automation tools
- Understanding of network segmentation, routing and VPNs
- Security knowledge: can work with our cyber teams and understand their recommendations
- Communication skills – be able to thoroughly document and articulate what you have built clearly and concisely to team members and stakeholders