Resilience is a cybersecurity company focused on integrating cyber insurance and risk management. The Senior DevOps Engineer will optimize cloud infrastructure operations, collaborate with teams to enhance development efficiency, and promote DevOps principles across the organization.
Responsibilities:
- Assist with streamlining the software development lifecycle by identifying pain points and productivity barriers, collaborating closely with stakeholders including development, quality assurance, and security to implement process improvements
- Collaborate with teams across the company to implement and maintain any infrastructure, tooling, or monitoring and alerting that is required to support their work
- Develop and maintain automation solutions and CI/CD pipelines to improve developer productivity and code quality
- Ensure that systems meet business and customer needs for reliability and availability
- Monitor and manage application performance and service quality, including working closely with stakeholders to troubleshoot production issues, identify root causes and issue resolution
- Promote DevOps principles and culture across the software engineering organization
Requirements:
- 5+ years of relevant DevOps experience
- Expertise designing, deploying, and managing scalable and resilient infrastructure on public cloud providers (AWS, GCP, Azure). Proficient in serverless container orchestration with AWS Fargate, Kubernetes or similar
- Expertise in defining and managing infrastructure using Terraform or similar tools
- Extensive experience in designing, developing, and optimizing complex CI/CD pipelines using tools like GitHub Actions, Travis, or Jenkins
- Experience implementing, maintaining, and optimizing build/test systems within a monorepo structure using advanced tooling like Nx, Pants, Bazel, or similar systems
- Experience implementing and maintaining logging, monitoring, and tracing (Observability) infrastructure (CloudWatch, Datadog, Prometheus, etc.)
- Proven experience driving effective team collaboration and technical alignment in an Agile environment
- Flexibility, adaptability, and a desire to learn new languages and technologies
- Strong verbal and written communication skills
- Familiar using Jira or other project management tools
- 4+ years of professional experience developing robust automation, tools, and services using Python or TypeScript/JavaScript
- Familiar with data engineering tools and platforms (RedShift, Snowflake, Airflow, Dagster, etc.)