Crossover is seeking a DevOps Engineer to maintain uptime for over 50 SaaS products. The role involves managing infrastructure initiatives, incident response, and implementing monitoring and automation solutions to ensure operational excellence.
Responsibilities:
- Advance reliability and standardization of cloud infrastructure across our expanding product portfolio by deploying comprehensive monitoring, automation, and AWS best practices
- Diagnosing production incidents, deploying immediate remediations, and authoring root cause analyses with permanent fixes routed to accountable teams
- Authoring, reviewing, and deploying production changes, including assessing whether a proposed change is safe for execution
Requirements:
- Deep AWS infrastructure expertise (this is our primary platform - other cloud experience alone won't cut it)
- Experience owning large production infrastructure and troubleshooting production outages independently (not just following a runbook)
- Experience scripting with Python and Bash for day-to-day administration operations
- Experience managing and migrating production databases with multiple engines (including MySql, Postgres, Oracle, MS-SQL)
- Experience with infrastructure automation (Terraform, Ansible, or CloudFormation)
- Linux systems administration expertise