Gluware is the intelligent network automation industry leader, powering self-operating enterprise networking for Global 2000 companies across various sectors. As a Senior DevOps Engineer, you will own the infrastructure, CI/CD pipelines, and multi-cloud operations to ensure the platform's reliability and scalability.
Responsibilities:
- Own design, provisioning, and day-to-day operations of multi-cloud infrastructure (AWS primary, Azure secondary) using IaC tools — Terraform and/or CloudFormation
- Manage VPC architecture, multi-region deployments, peering, and disaster recovery automation
- Maintain and evolve container orchestration (Kubernetes / EKS) and container image pipelines
- Drive cloud cost optimization, capacity planning, and right-sizing initiatives
- Own and continuously improve the end-to-end CI/CD pipeline (Jenkins or equivalent)
- Own and optimize release automation across dev, integration, staging, and production environments
- Champion GitOps practices and automated deployment strategies (blue/green, canary)
- Manage artifact repositories (Artifactory) and dependency scanning
- Establish and maintain SLOs/SLAs; lead incident response, blameless post-mortems, and reliability improvements
- Build and manage observability stack: metrics (Prometheus/Grafana or Datadog), logging (ELK/OpenSearch), and distributed tracing
- Evolve on-call processes and runbooks to reduce MTTR
- Assess and address vulnerabilities and integrate security tooling into pipelines (SAST, DAST, container scanning, secrets management via Vault or AWS Secrets Manager)
- Maintain SOC 2 compliance posture; participate in audits and evidence collection
- Manage access controls (LDAP and RBAC) across build tooling
- Maintain source code escrow
- Maintain and improve internal developer tooling, local dev environments, and self-service infrastructure capabilities
- Collaborate with Engineering to define and enforce platform standards and best practices
- Document architecture decisions and runbooks
Requirements:
- 5+ years of DevOps, SRE, or Platform Engineering experience in a product company
- Extensive experience with configuration management (Ansible or Chef preferred)
- Deep hands-on AWS experience (EC2, EKS/ECS, RDS, S3, Lambda, CloudFormation, VPC, Route 53, IAM); Azure a plus
- Strong IaC skills with Terraform (preferred) or Pulumi, CloudFormation, and packer
- Solid CI/CD pipeline experience —Jenkins, GitHub Actions, GitLab or equivalent; trunk-based development and deployment automation
- Proficiency in at least one scripting/automation language: Typescript, Python, Bash, Ruby, or Go
- Proficient with build management using nx + pnpm, maven, gradle
- Hands-on container platform experience: Docker, Kubernetes (EKS or self-managed)
- Strong Linux system administration and shell scripting skills
- Solid networking fundamentals (DNS, load balancing, VPNs, VPC peering, security groups, firewalls)
- Experience operating and improving observability stacks in production environments
- Bachelor's degree in computer science, Information Technology, or equivalent hands-on experience
- GitOps tooling familiarity (ArgoCD, Flux)
- Experience with bazel
- Experience with service mesh (Istio, Linkerd) or API gateway patterns
- Knowledge of networking vendor ecosystems (Cisco, Juniper, Arista) — Gluware orchestrates these environments
- Hands-on experience with AI/ML infrastructure or LLM-assisted DevOps workflows
- AWS Certified DevOps Engineer – Professional, CKA/CKAD, or equivalent certification
- Familiarity with Java, JavaScript, and Node.js (the languages our product is built on)
- Experience in a compliance-regulated SaaS environment (SOC 2, ISO 27001, or FedRAMP)