ICONMA is a company in the Internet Content & Information sector, seeking a Systems Engineer III for their remote location. The role involves applying computer science principles to design, develop, and evaluate software and systems, while also managing server lifecycle and automation processes.
Responsibilities:
- The main function of a systems engineer is to apply the principles of computer science and mathematical analysis to the design, development, testing, and evaluation of the software and systems that make computers work
- A typical systems engineer analyzes user needs, and then designs, tests, and develops software to meet those needs
- Develop back-end services, workflows and automation which handle fleet management, full server lifecycle (network boot, firmware updates, provisioning of os, failure detection, erase and decom)
- Develop out-of-band server management tooling in multi-vendor environment, including state management and telemetry collection
- Write and review code, test changes on hardware, automate testing
- Troubleshoot provisioning, firmware update and network boot issues end to end
- Build out-of-band server management tools in a multi-vendor environment (state management, telemetry)
- Collaborate cross-functionally and work autonomously to solve complex problems
Requirements:
- 3+ years of experience in the area of:
- Experience in TCP/IP network fundamentals
- Experience with Linux systems and server systems management
- Experience in troubleshooting server network boot process from both network and servers point of view
- Experience in coding and scripting automation (Python, Go, Rust, Bash, Ruby)
- Experience in automating management of large fleets of server hardware (Redfish, IPMI)
- Experience with proven troubleshooting analytics of server/systems architecture and components
- Experience with Kubernetes, Docker and/or cloud deployment technologies
- Bachelor's degree in computer science, software engineering or relevant field required. Experience can be used in lieu of education
- Deep understanding of networking concepts and troubleshooting network boot processes
- Experience managing Linux servers, including provisioning, firmware updates, and lifecycle management
- Proficiency in automation using languages such as Python, Go, Rust, Bash, or Ruby
- Experience with Server Management Protocols
- Familiarity with Redfish, IPMI for automating large server fleets
- Cloud & Container Technologies
- Experience with Kubernetes, Docker, and cloud deployment
- Troubleshooting Analytics
- Proven ability to analyze server/systems architecture and components
- Linux/TCP/IP certification preferred
- Certifications are preferred but not strictly required