Lead GFiber’s peering, caching and transit infrastructure design and IP planning. Optimizing traffic engineering, IX/Transit/CDN integrations, capacity planning to ensure optimal latency and cost-efficiency across the edge.
Partner with software teams and Network Reliability Engineering team on the design and development of the GFiber automation stack, advancing low & zero touch operations and rapid configuration deployments across the core and edge
Define and evolve standards for network observability and network health, integrating telemetry, fault management, and incident data to drive actionable insights and implement auto-remediation strategies.
Serve as the Tier-3 escalation for complex routing, convergence, or large-scale DDoS issues, contribute to root cause analysis and propose remediation workflows to prevent recurrence.
Collaborate with Product and Software teams to enable advanced service offerings (e.g., L2/L3 VPNs, DIA) and influence vendor roadmaps to align with GFiber’s business goals.
Requirements
Bachelor’s degree in Computer Science, Electrical Engineering, a related field, or equivalent practical experience.
7 years in service provider network design and operations, focusing on high-availability core environments.
Knowledge of IP/MPLS protocols
BGP (v4/v6), IS-IS/OSPF, Segment Routing (SR-MPLS/SRv6), RSVP-TE, and EVPN, network troubleshooting and packet-level analysis tools (NetFlow, SNMP, Wireshark, TCPdump) and direct experience with multi-vendor platforms (e.g., Juniper, Nokia, Arista, or Cisco).
Experience with automation and scripting for device configuration and validation (Python, Go, Ansible, or Netconf/YANG).
Experience in incident management, telemetry design, and maintaining high-availability systems in a large network production environment.
It's preferred if you have:
Experience with building end to end automation workflows and event driven automation concepts.
Familiarity with implementing Site Reliability Engineering (SRE) principles within a network context, including SLO/SLA definition and error budget management.
Experience with modern telemetry stacks such as Prometheus, Grafana, or custom alerting systems.
Deep understanding of BGP attributes and traffic engineering across an ISP core network.
JNCIE-SP, CCIE Service Provider, or equivalent deep industry experience.