PRI Global is seeking an experienced Systems Engineer to design, develop, test, and maintain infrastructure systems for large-scale server environments. The role involves managing backend services, automation, and the lifecycle management of servers while troubleshooting network boot and provisioning issues.
Responsibilities:
- Develop backend services, workflows, and automation for server fleet management
- Manage full server lifecycle including network boot, firmware updates, OS provisioning, failure detection, and decommissioning
- Build and maintain out-of-band server management tooling across multi-vendor environments
- Write and review code and automate hardware testing processes
- Troubleshoot server provisioning, firmware updates, and network boot issues end-to-end
- Implement telemetry collection and state management for server infrastructure
Requirements:
- Strong knowledge of TCP/IP networking fundamentals
- Experience with Linux systems administration and server management
- Experience troubleshooting server network boot processes
- Automation scripting experience using Python, Go, Rust, Bash, or Ruby
- Experience managing large server fleets using Redfish / IPMI
- Strong troubleshooting skills related to server architecture and hardware components
- Experience with container and cloud technologies such as Docker and Kubernetes
- Bachelor's degree in Computer Science, Software Engineering, or related field
- Equivalent work experience may be considered
- Certifications related to Linux or TCP/IP networking are preferred