Develop and implement a comprehensive AI Ops network automation strategy aligned with organizational goals and objectives.
Identify opportunities for automation in network provisioning, configuration management, monitoring and analysis.
Provision and manage on-prem AI infrastructure and containerized environments (Docker, Kubernetes) optimized for GPU workloads and distributed compute.
Deployment and orchestration of agentic AI systems (LangChain, LangGraph etc).
Implement security best practices including secrets management, model access control, data encryption, and audit logging for AI pipelines.
Monitor and optimize performance of AI workloads.
Develop and maintain scripts, workflows, and code to automate network tasks using programming languages such as Python, Ansible and other automation tools.
Collaborate with software development teams to integrate network automation into the CI/CD pipeline.
Implement network orchestration solutions to streamline the provisioning and management of network services.
Develop network inventory reporting and control applications.
Develop telemetry and Netflow based analytics solutions for a carrier grade network.
Integrate network automation with monitoring tools for real-time visibility and reporting.
Requirements
4+ years of full software development life cycle experience in a Network technology environment.
Preferably with experience in AI/ML environments.
Strong proficiency in scripting and coding languages such as Python and YAML.
Experience with network automation tools and frameworks (e.g., Ansible, Puppet, Chef).
Good understanding of GIT workflows and integration into CI/CD pipelines (Github, Gitlab, Jenkins Server and GPU infrastructure management experience
Solid understanding of container lifecycle management.
Foundational knowledge of network protocols, routing, switching, platforms and network inventory management.
Network analytics experience including Netflow and processing of high-volume telemetry data.
Experience with Netbox customization a plus.
Must be result-oriented and constantly looking for new ways to improve efficiency and productivity.
Must be able to exercise independent judgment under minimum supervision.
Ability to assess and analyze technical situations and provide viable alternatives.