TechClub Inc is seeking a highly skilled Site Reliability Engineer (SRE) to own the overall health, availability, performance, and resilience of their enterprise platform. The SRE will lead reliability engineering practices, manage infrastructure deployment pipelines, drive application deployments, and ensure timely remediation of security vulnerabilities.
Responsibilities:
- Own the overall health, availability, performance, and resilience of our enterprise platform
- Lead reliability engineering practices across the stack
- Manage infrastructure deployment pipelines using Terraform
- Drive application deployments through GitHub and Azure DevOps
- Ensure timely remediation of security vulnerabilities
- Implement world class observability using Dynatrace and Splunk
Requirements:
- Highly skilled Site Reliability Engineer (SRE)
- Ownership of overall health, availability, performance, and resilience of enterprise platform
- Experience with SQL Server
- Experience with .NET
- Experience with Java
- Experience with React.js
- Experience with Microservices
- Experience with Kafka
- Experience operating in a hybrid cloud environment on Azure and On Premises
- Lead reliability engineering practices across the stack
- Manage infrastructure deployment pipelines using Terraform
- Drive application deployments through GitHub and Azure DevOps
- Ensure timely remediation of security vulnerabilities
- Implement world class observability using Dynatrace and Splunk