Cornelis Networks delivers high-performance scale-out networking solutions for AI and HPC datacenters. They are seeking a Senior System Validation Engineer to drive system- and component-level validation, including test design, automation, and CI/CD pipeline maintenance.
Responsibilities:
- Architect and drive the implementation of validation infrastructure including shared libraries, frameworks, and reusable modules, written in Python
- Clearly document infrastructure and test development plans and align them to key feature delivery dates across multiple products and validation cycles
- Design and execute test cases at all stages of the validation lifecycle, from manual investigative testing to complete automated regression test suites
- Design, implement, update, and debug automated test cases
- Configure and maintain continuous integration systems, monitor results, and triage failures
- Troubleshoot and debug issues at the system, network, and software component levels
- Champion AI-assisted development tool adoption into team workflows, including evaluating new tools, establishing best practices for use, and driving process improvements that increase development velocity and code quality
- Identify gaps, suggest solutions, and continually enhance software validation procedures, processes, and practices
Requirements:
- S or M.S. Degree in Computer Science, Engineering, Technology, or related discipline and at least 3-5 years of work experience
- Candidate must demonstrate excellent communication, organization, and time management skills
- Demonstrated experience with object-oriented programming in Python
- Experience with multiple of the following: Software validation at the system and/or integration level, Debugging and troubleshooting system, network, and software issues, Collaborating across teams to ensure quality test development and issue resolution
- Solid understanding of agile practices such as sprint planning, backlog grooming, and retrospectives, with demonstrated experience leading scrum ceremonies and providing technical guidance to team members
- Experience with networking technologies, especially Omni-Path, InfiniBand, Ethernet, or other HPC or RDMA interconnects
- Working knowledge of version control software (e.g., git)
- Solid understanding of CI/CD concepts and experience with pipeline automation tools such as Jenkins
- Experience leveraging AI-assisted development tools to accelerate coding workflows, including code generation, debugging, and test reviews
- Prior Linux system administration experience
- Utilizing Ansible for configuration management and deployment
- Knowledge of storage filesystems, protocols, and benchmarks
- Ability to work independently and proactively in a highly collaborative, multi-disciplinary engineering environment
- Knowledge of UEFI capabilities and utilizing PXE boot for installation and configuration of various Linux distributions
- Experience with Ethernet networking protocols and standards (IEEE 802.3) and familiarity with the Ultra Ethernet Consortium specification is a plus