Oracle is a leading company in AI and cloud solutions. They are seeking a Senior Network Development Engineer to support the design, deployment, and operations of their global Oracle Cloud Infrastructure, focusing on RDMA cluster networking for AI and HPC workloads.
Responsibilities:
- Collaborate with program/project managers to develop milestones and deliverables
- Will primarily use existing procedures and tools to develop and safely execute network change. However, may have to develop new procedures from time to time
- Develop solutions to enable front line support teams to act on network failure conditions
- Troubleshoot complex cabling connectivity issues in data centers, with the ability to coordinate on-site efforts between L1 Teams and TPMs
- Willingness to travel to data center locations; 25% travel required
- Mentor junior engineers
- Participates in network solution and architecture design process and contribute to the roadmaps development
- Participate in operational rotations as either primary or secondary
- Provide break-fix support for events. Serve as the escalation point for event remediation. Lead post-event root cause analysis
- Frequently develops scripts to automate routine tasks for team and business units
- Coordinate with networking automation services for the development and integration of support tooling
- Coordinate with network monitoring to gather telemetry and create alerts rules using them
- Build dashboards to represent data at various network layers and device roles that help identify network issues, anomalies
- Serves as SME on software development projects for network automation and network monitoring
- Collaborate with network vendor technical account team and internal Quality Assurance team to drive bug resolution and assist in the qualification of new firmware and/or operating systems
Requirements:
- Bachelor's degree in CS or related engineering field with 5+ years of Network Engineering experience or master's with 3+ years of Network Engineering experience
- Experience working in a large ISP or cloud provider environment. Experience in RDMA Networking is a plus
- Experience working in a network operations role
- Folks with strong knowledge of protocols such as MPLS, BGP/OSPF/IS-IS, TCP, IPv4, IPv6, DNS, and DHCP. Also, VxLAN and EVPN will be an added advantage
- Extensive experience with scripting or automation and data center design – Python preferred but must demonstrate expertise in scripting or compiled language
- Experience with networking protocols such as TCP/IP, VPN, DNS, DHCP, and SSL
- Experience with network monitoring and telemetry solutions
- Experience with network modeling and programming – YANG, OpenConfig, NETCONF
- Ability to use professional concepts and company objectives to resolve complex issues in creative and effective ways. Capable of working under limited supervision
- Excellent organizational, verbal, and written communication skills
- Excellent judgment in influencing product roadmap direction, features, and priorities
- Participate in an on-call rotation