Teradata is a leading company in cloud analytics and data platforms for AI. They are seeking a Staff Software Engineer to lead the design, development, and evolution of networking software for their massively parallel processing platform, influencing technical direction and mentoring engineers.
Responsibilities:
- Architect, design, and evolve scalable, reliable, and fault-tolerant networking software for high-speed, low-latency interconnects, delivering predictable performance across large-scale MPP systems
- Evaluate and drive adoption of emerging technologies across operating systems, high-performance networking, adapters, DPUs, accelerators, and interconnect fabrics
- Lead complex debugging and root-cause analysis of system-level customer and field issues, including SLES OS crash dump analysis, spanning hardware, firmware, OS, and networking layers
- Define and execute targeted research initiatives and proof-of-concepts to validate new technologies, quantify performance, and guide platform decisions
- Partner with product, hardware, and systems engineering teams to scope, prototype, benchmark, and productionize platform enhancements
- Establish performance benchmarks, validation methodologies, and success metrics for networking and interconnect innovations
- Influence platform roadmaps through deep understanding of industry trends, academic research, and partner technologies
- Mentor and technically guide other engineers through design reviews, code reviews, and architectural discussions
- Leverage AI-assisted coding, analysis, and testing tools to accelerate development cycles and improve code quality and reliability
Requirements:
- Strong background in HPC or large-scale distributed systems development
- Proven experience with Linux kernel and driver development in C, including production support
- Deep familiarity with bare-metal and virtualized environments, including performance tradeoffs
- Expertise in InfiniBand and Ethernet networking, leveraging RDMA and RoCE for low-latency, high-throughput communication
- Solid understanding of TCP/IP and UDP networking, along with Linux networking, tuning, and diagnostic tools
- Packet-level analysis and Linux kernel debugging using tools such as tcpdump, kgdb, and crash
- Experience designing and optimizing high-throughput, low-latency data transport protocols
- Strong knowledge of the Linux kernel, including DKMS, driver lifecycle management, and compatibility across kernel versions
- Proficiency in C, Bash, and Python for systems programming, automation, and diagnostics
- Experience with massively parallel processing (MPP) using message-passing interfaces
- Effective use of modern AI-assisted development tools to accelerate design, coding, and debugging
- Bachelor's degree in Computer Science (distributed systems focus preferred), Computer Engineering, or Electrical Engineering, or equivalent practical experience
- 7+ years of experience in high-performance Linux systems or networking software development, with demonstrated technical leadership
- Experience with DPUs, SmartNICs, or hardware offload technologies
- Hands-on work with kernel-bypass networking (e.g., RDMA verbs, DPDK, XDP, eBPF)
- Experience with high-speed Ethernet (100G/200G/400G/800G) and modern interconnect fabrics
- Experience tuning systems for NUMA, CPU affinity, cache locality, and memory bandwidth
- Exposure to distributed storage or database platforms in production environments
- Experience working with hardware vendors (NICs, switches, accelerators) on performance or integration issues
- Contributions to open-source networking, kernel, or systems software projects