Cresta is on a mission to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. The Senior Infrastructure Engineer/SRE will be responsible for designing, building, and advancing the core infrastructure to enable the engineering team to execute efficiently and securely.
Responsibilities:
- Partner with engineers to build dev tools that empower developer workflows and deployment infrastructure
- Ensure reliability of multi-cloud Kubernetes clusters and pipelines
- Metrics, logging, analytics, and alerting for performance and security across all endpoints and applications
- Infrastructure-as-code deployment tooling and supporting services on multiple cloud providers
- Automate operations and engineering
- Focus on automation so we can spend energy where it matters
- Building machine learning infrastructure that enables AI teams to train, test, and deploy on large-scale datasets
Requirements:
- 5+ years experience in DevOps, Site Reliability Engineering, Production Engineering, or equivalent field
- Deep proficiency with coding languages such as Golang or Python
- Deep familiarity with container-related security best practices
- Production experience working with Kubernetes, and a deep understanding of the Kubernetes ecosystem, including popular open-source tooling such as cert-manager or external-dns
- Production experience with Kubernetes templating tools such as Helm or Kustomize
- Production experience with IAC tools such as Terraform or CloudFormation
- Production experience working with AWS and services such as IAM, S3, EC2, and EKS
- Production experience with database software such as PostgreSQL
- Experience with GitOps tooling such as Flux or Argo
- Experience with CI/CD such as GitHub Actions
- Experience with GPU-enabled clusters is a bonus
- Production experience with other cloud providers such as Google Cloud and Azure is a bonus