Cornelis Networks delivers high-performance scale-out networking solutions for AI and HPC datacenters. They are seeking a Software Engineer for their Fabric Management team to develop and enhance fabric management tools and infrastructure, focusing on building software for fabric topology management and performance monitoring.
Responsibilities:
- Design, implement, and maintain fabric management software components including topology management, routing protocols, and control plane services
- Develop and enhance fast fabric tools, diagnostic utilities, and operational tooling for fabric deployment and management
- Contribute to fabric simulation infrastructure for validation, testing, and performance analysis
- Build reliable, scalable distributed systems for managing large-scale fabric deployments
- Profile and optimize fabric management software for latency, scalability, and resource efficiency
- Write clean, maintainable code with comprehensive unit and integration tests; participate in code reviews
- Contribute to customer issue resolution and technical support when needed
- Work closely with hardware, firmware, and platform teams to integrate fabric management with switch platforms
- Maintain and improve existing codebase, addressing technical debt and enhancing reliability
- Leverage AI coding assistants and emerging tooling to accelerate development velocity and code quality
Requirements:
- BS in Computer Science, Computer Engineering, or related field
- 5-7 years of software engineering experience
- Strong proficiency in C/C++ and Python
- Linux systems programming experience
- Network protocol implementation or distributed systems experience
- Strong debugging and problem-solving skills for complex systems
- Excellent communication and collaboration skills
- Commitment to AI-first development practices and continuous learning of emerging AI tooling
- Kubernetes development experience (CNI, device plugins, operators) - strongly preferred
- Go/Golang proficiency
- Experience with fabric routing protocols, topology management, or distributed control planes
- InfiniBand, Omni-Path, or high-speed interconnect protocol knowledge
- HPC fabric management or cluster management experience
- Network simulation or modeling experience
- Performance optimization and profiling experience
- Open-source contribution history
- Technical leadership or mentoring experience
- Experience with AI-assisted development tools (GitHub Copilot, Cursor, or similar)
- Familiarity with test automation frameworks and CI/CD systems