Applicantz is seeking a highly skilled Senior Network DevOps Engineer to drive the transformation of traditional network operations into a fully automated, cloud-driven, self-healing infrastructure platform. The role involves designing, automating, monitoring, and operating large-scale enterprise networks while collaborating with multiple teams to build resilient and secure network services.
Responsibilities:
- Design and develop network automation frameworks using Python, Ansible, Terraform, and APIs
- Build Infrastructure-as-Code (IaC) solutions for enterprise networking and security platforms
- Automate provisioning, configuration, validation, compliance checks, and remediation workflows
- Develop reusable automation modules, SDKs, and API integrations
- Implement GitOps workflows for network and security infrastructure changes
- Create self-service network automation capabilities for internal engineering teams
- Design and operate cloud networking solutions across AWS, Azure, and GCP
- Automate management of:
- VPC/VNET architectures
- Transit Gateways
- Cloud Firewalls
- Load Balancers
- DNS Services
- VPN Connectivity
- Direct Connect / ExpressRoute
- Implement multi-cloud networking standards and governance
- Automate firewall policy lifecycle management
- Integrate security controls into CI/CD pipelines
- Develop compliance validation frameworks
- Automate risk analysis and security posture assessments
- Collaborate with cybersecurity teams on Zero Trust initiatives
- Build enterprise observability solutions using:
- Grafana
- Prometheus
- InfluxDB
- ELK Stack
- Splunk
- Dynatrace
- OpenTelemetry
- Create dashboards, alerts, health checks, and performance analytics
- Develop network telemetry pipelines and operational intelligence platforms
- Design AI-assisted operational workflows
- Develop integrations with:
- LLMs
- RAG platforms
- AI Agents
- ChatOps platforms
- Build intelligent troubleshooting and incident response automation
- Implement predictive analytics and anomaly detection solutions
- Build and maintain CI/CD pipelines
- Implement automated testing frameworks
- Integrate security scanning into deployment workflows
- Establish engineering standards and best practices
- Drive platform reliability initiatives across infrastructure services
Requirements:
- Agentic AI development
- DevOps (Python, Ansible, CI/ CD)
- AWS AgentCore
- Cloud networking
- Design and develop network automation frameworks using Python, Ansible, Terraform, and APIs
- Build Infrastructure-as-Code (IaC) solutions for enterprise networking and security platforms
- Automate provisioning, configuration, validation, compliance checks, and remediation workflows
- Develop reusable automation modules, SDKs, and API integrations
- Implement GitOps workflows for network and security infrastructure changes
- Create self-service network automation capabilities for internal engineering teams
- Design and operate cloud networking solutions across AWS, Azure, and GCP
- Automate management of VPC/VNET architectures, Transit Gateways, Cloud Firewalls, Load Balancers, DNS Services, VPN Connectivity, Direct Connect / ExpressRoute
- Implement multi-cloud networking standards and governance
- Automate firewall policy lifecycle management
- Integrate security controls into CI/CD pipelines
- Develop compliance validation frameworks
- Automate risk analysis and security posture assessments
- Collaborate with cybersecurity teams on Zero Trust initiatives
- Build enterprise observability solutions using Grafana, Prometheus, InfluxDB, ELK Stack, Splunk, Dynatrace, OpenTelemetry
- Create dashboards, alerts, health checks, and performance analytics
- Develop network telemetry pipelines and operational intelligence platforms
- Design AI-assisted operational workflows
- Develop integrations with LLMs, RAG platforms, AI Agents, ChatOps platforms
- Build intelligent troubleshooting and incident response automation
- Implement predictive analytics and anomaly detection solutions
- Build and maintain CI/CD pipelines
- Implement automated testing frameworks
- Integrate security scanning into deployment workflows
- Establish engineering standards and best practices
- Drive platform reliability initiatives across infrastructure services