Cisco is a leading technology company that focuses on networking, security, and observability solutions. They are seeking a Technical Leader for their Platform Engineering team to ensure the reliability and performance of their segmentation and firewall platform, collaborating with various teams and managing the lifecycle of production services.
Responsibilities:
- Be part of an Agile team moving at a very fast pace to ensure reliability, availability and performance of our next-generation segmentation and firewall platform
- The team owns the full lifecycle of production services — from infrastructure architecture through deployment, observability, and operations
- As a Technical Leader, you will influence reliability architecture and drive operational excellence, collaborating with product management and field teams to ensure platform reliability
- Own the lifecycle of production services — deployment, monitoring, incident response, and capacity planning in a Linux-based multi-VM distributed systems environment
- Perform systems-level debugging and performance tuning in virtual machines, containers, and Kubernetes environments
- Build automation, tooling, and reliability frameworks in Python or Go, including database access layers and secrets storage, primarily using MongoDB, Vault, and Consul
- Operate and troubleshoot in Linux environments, including containers, virtual machines, and Kubernetes-orchestrated environments
- Manage and scale cloud infrastructure in AWS and GCP using Infrastructure as Code (IaC) tools such as Terraform
- Own and evolve CI/CD pipelines to enable rapid, safe deployment of changes in SaaS environments
- Conduct reviews, create runbooks, and lead post-incident learning across team members and the wider group, representing our platform in all forums
- Stay current with emerging technologies and industry trends in networking, security, cloud-native infrastructure, and related areas, while mentoring others
Requirements:
- Bachelors + 8 years of related experience, or Masters + 6 years of related experience, or PhD + 3 years of related experience
- 6+ years in software development in Python or Go
- Experience with AWS, GCP, or other major cloud service providers, including deploying, managing, or troubleshooting cloud infrastructure and services in production environments
- Work experience across diverse infrastructure environments, including on-premise, cloud platforms, or hybrid environments
- Experience with Linux systems including networking
- Experience with bare metal servers or virtual machines (VMs)
- Experience with containerized environments or Kubernetes-orchestrated systems
- Troubleshooting experience, to include cloud, on-premise or hybrid environments
- Experience designing and building Infrastructure as Code (IaC) with Terraform
- Experience in Kubernetes networking
- Strong background in testing high-performance computing, multi-threading, and low-latency systems
- Scripting, Ansible playbooks, and Jenkins experience
- Excellent verbal and written communication skills and professional presentation