Design, build, and operate highly available, scalable clusters supporting core data technologies including MongoDB, ElasticSearch, and Apache Kafka.
Own major platform components and deliver complex initiatives from design through to production.
Implement and enhance architectural patterns and platform standards for reliability, scalability, and performance.
Troubleshoot and resolve complex distributed systems issues across multi-cluster environments.
Build and maintain Infrastructure as Code and CI/CD pipelines to ensure repeatable, scalable deployments.
Contribute to observability, reliability, and operational excellence across the data platform estate.
Collaborate with engineering teams and Product Management to align platform capabilities with product needs.
Mentor junior engineers and contribute to knowledge sharing within the team.
Participate in on-call rotations to support platform reliability and continuous improvement.
Requirements
5+ years of experience in platform engineering, SRE, or infrastructure-focused roles.
Strong experience operating at least one of MongoDB, Kafka, or ElasticSearch in production environments, including day-2 operations.
Solid experience designing and operating Kubernetes environments and ecosystem tooling (e.g. Helm, ArgoCD) is a significant plus.
Proficiency in at least one programming language (Python, Java, or similar).
Experience with Infrastructure as Code tools such as Terraform.
Hands-on experience working with major cloud platforms (AWS preferred).
Experience implementing, maintaining and evolving observability solutions (e.g.Prometheus/Grafana or ELK).
Good understanding of security principles and experience embedding security best practices into production environments and code promotion systems.
Excellent collaboration and communication skills.
Strong problem-solving ability and attention to detail.
Tech Stack
Apache
AWS
Cloud
Distributed Systems
ElasticSearch
Grafana
Java
Kafka
Kubernetes
MongoDB
Prometheus
Python
Terraform
Benefits
Smarsh hires lifelong learners with a passion for innovating with purpose, humility and humor.
Collaboration is at the heart of everything we do.
We work closely with the most popular communications platforms and the world’s leading cloud infrastructure platforms.
We use the latest in AI/ML technology to help our customers break new ground at scale.
We are a global organization that values diversity, and we believe that providing opportunities for everyone to be their authentic self is key to our success.
Smarsh leadership, culture, and commitment to developing our people have all garnered Comparably.com Best Places to Work Awards.