Sprout Social is looking to hire a Site Reliability Engineer to the Engineering team. This role involves designing and building reliable, scalable systems that support Sprout’s global customer base, driving infrastructure initiatives, and improving security posture through automation and collaboration.
Responsibilities:
- Design and build reliable, scalable, and performant systems that support Sprout’s global 30,000+ customer base across 100+ countries
- Drive infrastructure initiatives that enable product teams to deliver value quickly and safely through shared, production-ready tools and platforms (“Paved Roads”)
- Work to improve Sprout’s security posture through automation, auditability, and clear processes in order to build sustainable and secure solutions
- Collaborate cross-functionally with product, site reliability engineering, data platform, and GRC teams to deliver scalable, secure-by-default infrastructure
- Investigate and learn from system failures and incidents to improve overall system resilience
- Contribute to security tooling deployments and maintenance to improve overall security posture
Requirements:
- 1+ years of experience building and maintaining reliable systems in a Linux/Unix environment
- 1+ years experience with one or more infrastructure-as-code or configuration-as-code tools, such as: Terraform, Chef, Ansible, SaltStack, etc
- 1+ years experience in writing code for automation in one programming language, such as Python, Java, Golang, or Ruby
- 1+ years experience with cloud platforms and understanding of cloud security concepts
- 2+ years experience with Amazon Web Services (AWS)
- Experience with cloud security tools such as WAF, IAM, AWS Config, or similar
- Experience with building CI/CD Pipelines using tools such as Jenkins, Gitlab, Github Actions, etc
- Familiarity with security tooling such as CNAPP, CWPP, CSPM, IDS/IPS, or similar