Coralogix is a modern, full-stack observability platform transforming how businesses process and understand their data. They are looking for a Site Reliability Engineer to work as part of their Cloud Infrastructure Team, focusing on Enterprise FedRAMP Cloud Infrastructure and enhancing operational efficiency.
Responsibilities:
- Work in high scale environments - Coralogix data pipeline processes 55Tb of data each day
- Adopt cutting edge technologies with end-to-end responsibility
- Building internal tools to expand our platform capabilities
- Collaborate with R&D to improve stability & reliability of the system
- Lead the product roadmap - our product is designed for engineers. Therefore, our engineers promote, enhance, and take a crucial part in influencing the product roadmap
- Perform operational duties for FedRAMP cloud products, including deployments, on-call support, and incident management
Requirements:
- At least 5 years of experience as a DevOps Engineer/ SRE in production environments
- In-depth experience with Kubernetes - operating & monitoring are key parts
- At least 2 years of experience with FedRAMP compliance (High/Moderate levels), vulnerability management, and continuous monitoring, including scanning, patching, and reporting
- High familiarity with monitoring tools such as Coralogix, Grafana, Prometheus
- Experience in AWS or other cloud providers
- Experience with infrastructure as a code (Terraform, Crossplane, etc.)
- Understanding of networking - from networking layers to different networking protocols (http, grpc, ssl)
- Some software engineering experience, preferably in Golang
- Operating data pipelines
- Familiarity with Apache Kafka