Inktavo and OrderMyGear are modernizing the apparel and promotional product decoration industry. They are seeking a Senior Site Reliability & Platform Engineer to design and maintain cloud environments, implement observability, and build automated processes to enhance platform reliability and security.
Responsibilities:
- Design and maintain our core Kubernetes and Cloud Native environments within GCP, AWS, and Azure, ensuring high availability, scalability, security, and seamless deployment patterns
- Implement a comprehensive observability stack to provide deep insights into system health, performance, and security posture
- Provide expertise in integrating and bridging legacy or specialized workloads in Azure and AWS
- Build automated, repeatable processes for provisioning and deprovisioning infrastructure, reducing manual toil to near zero
- Develop self-service tools that empower DevOps and Engineering teams to manage their own tool configurations while remaining compliant with MergeCo security standards
Requirements:
- 5+ years in SRE, DevOps, or Platform Engineering roles
- Expert-level Kubernetes orchestration and containerization (Docker/Containerd)
- GCP Professional Cloud Architect or equivalent experience (IAM, VPCs, GKE, Cloud Operations)
- Deep proficiency in Terraform, CDK, or Pulumi
- Experience with Prometheus, Grafana, ELK, or Datadog to drive SLIs/SLOs
- Proficiency in Go, Python, or similar for tooling and automation
- Familiarity with Azure/AWS for hybrid-cloud connectivity and migrations
- Experience navigating complex multi-cloud environments