IEX is an exchange operator and technology company dedicated to innovating for performance in capital markets. They are seeking a Senior Systems Reliability Engineer to optimize and automate processes and systems for improved reliability, scalability, and maintainability.
Responsibilities:
- Responsible for the technical operations of our trading platform
- Participate in the engineering process as we design, build, and manage our systems
- Build tools to monitor and automate processes
- Be a core contributor in our change management and learning review processes
- Troubleshoot issues across the whole stack - hardware, software, application, and network
- Document current and future configuration processes and policies
- Translate customer needs and projected product utilization to operational reliability targets
- Guides other functions (e.g., development, market operations, business development, subscribers, etc.) on reliability techniques, application, and system functionality
- Educate and mentor team/company on operational best practices
Requirements:
- Automation experience with Ansible or a similar configuration management tools
- Hands-on experience with Linux, python, bash, and git
- Experience supporting large, complex, and distributed systems
- Host side networking
- Packet level understanding of network traffic, working experience troubleshooting with packet captures, etc
- TCP/IP Stack, routes
- Multicast
- General familiarity with Data Center workflows and working with DC personnel to implement changes
- Hardware familiarity
- Arista and Cisco switches and their CLI's
- Corvil
- Solarflare, Mellanox