Iceberg is a high-growth SaaS DevSecOps platform seeking a Site Reliability Engineer / DevOps Engineer with strong AWS experience. The role involves supporting the build, scaling, and operation of their cloud infrastructure while contributing to improving system reliability and automation.
Responsibilities:
- Supporting the design and maintenance of AWS infrastructure using Terraform
- Assisting with building and maintaining CI/CD pipelines
- Monitoring system performance and helping improve reliability and availability
- Troubleshooting incidents and contributing to root cause analysis
- Working with engineering teams to support reliable and secure deployments
- Automating manual processes where possible
- Participating in an on-call rotation
Requirements:
- 3+ years in SRE, DevOps, or cloud engineering roles
- Hands-on experience working with AWS (infrastructure, not just usage)
- Exposure to Terraform or other Infrastructure as Code tools
- Experience with Kubernetes (ideally in AWS environments)
- Familiarity with CI/CD pipelines (GitLab, Jenkins, GitHub Actions, etc.)
- Scripting experience (Bash and/or Python)
- Understanding of monitoring/logging tools (e.g. Datadog, ELK, Prometheus, Grafana)
- Awareness of security best practices in cloud environments
- US Citizen (non-negotiable)