Deepgram is the leading platform in the Voice AI economy, providing real-time APIs for speech-to-text and text-to-speech. The Network Engineer will optimize and manage the global network infrastructure to ensure ultra-low-latency experiences for real-time voice AI applications.
Responsibilities:
- Architect and optimize Deepgram's global network for real-time audio streaming, minimizing latency across geographic regions
- Design and manage CDN and edge computing strategies to bring processing closer to end users
- Configure and maintain BGP routing, including peering relationships and route optimization for multi-cloud and bare metal environments
- Implement and tune load balancing solutions that intelligently distribute traffic across heterogeneous infrastructure (cloud, bare metal, GPU clusters)
- Build and optimize multi-cloud networking architectures spanning AWS, other cloud providers, and on-premises data centers
- Design and maintain bare metal networking including top-of-rack switching, spine-leaf architectures, and high-bandwidth GPU interconnects
- Develop latency measurement, monitoring, and optimization tooling to continuously improve network performance
- Plan and execute network capacity expansions as Deepgram's traffic and infrastructure footprint grow
- Collaborate with infrastructure and platform teams to ensure network designs support both real-time inference and large-scale training workloads
- Establish network security best practices including segmentation, DDoS mitigation, and access control policies
Requirements:
- 5+ years of experience in network engineering, including design, implementation, and operations
- Deep expertise with BGP, OSPF, and other dynamic routing protocols in production environments
- Hands-on experience designing and operating load balancing solutions at scale
- Strong understanding of CDN architectures and content delivery optimization
- Experience with cloud networking, specifically AWS VPC design, transit gateways, peering, and hybrid connectivity
- Practical experience with bare metal networking -- switch configuration, VLAN design, physical infrastructure
- Solid understanding of TCP/IP, DNS, TLS, and HTTP/2 / HTTP/3 performance optimization
- Experience with network monitoring, troubleshooting, and performance analysis tools
- Experience with real-time streaming protocols (WebSocket, WebRTC, RTP/RTSP) and optimizing networks for media traffic
- Knowledge of GPU cluster networking including InfiniBand, RoCE (RDMA over Converged Ethernet), and high-bandwidth interconnects
- Experience with multi-cloud networking spanning AWS, GCP, and/or Azure
- Familiarity with network automation and Infrastructure as Code (Ansible, Terraform, or similar)
- Experience with anycast routing and global traffic management
- Background in network performance optimization for latency-sensitive applications
- Understanding of DDoS mitigation strategies and network security at scale