AWSCloudFluxGrafanaKubernetesPythonTerraformGitHub ActionsEKSHelmArgoCDCloudFormationRDSIAMCloudWatchGlueDatadogSentryGitHubGitOpsSource ControlCI/CDRemote Work
About this role
Role Overview
Support the Platform Infrastructure
Help manage and scale our container environment on Amazon EKS, implement GitOps workflows using ArgoCD, and maintain CI/CD pipelines through GitHub Actions to ensure that deployments are fast, consistent, and automated.
Build for Reliability
Define and track SLIs and SLOs, lead incident response including on-call rotations, root cause analysis, and post-mortems, and contribute to disaster recovery planning to keep our systems highly available.
Drive Observability
Design and maintain our monitoring and logging stack using Datadog, Sentry, and CloudWatch — giving engineering teams clear visibility into system health and performance before problems reach users.
Shape the Platform's Future
Collaborate on architectural decisions, build internal tooling and self-service workflows that make the platform easier to operate, and contribute meaningfully to how we scale and evolve our infrastructure
Requirements
3+ years in SRE, DevOps, or Cloud Infrastructure
Confident working with core AWS services (VPC, IAM, EKS, RDS) and a strong understanding of cloud networking and security best practices.
Expert in using Infrastructure as code with Terraform, CloudFormation, or Crossplane
Proficient with GitHub and GitHub Actions as a core component of your CI/CD and automation pipelines
not just for source control
Experienced with running Kubernetes clusters in production and managing application deployments through GitOps workflows (ArgoCD/Flux) and Helm Charts,
Proficient with observability tooling such as Datadog, Sentry, CloudWatch, Grafana to include building alerts, dashboards, and log pipelines.
Experience writing solid Python scripts to glue systems together, automate infrastructure tasks, or handle custom workflows.
Comfortable working independently in a remote setup, asking questions when needed, and keeping momentum without being micromanaged.
Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
Nice to haves: Certifications: AWS, Kubernetes, Terraform or Python
Tech Stack
AWS
Cloud
Flux
Grafana
Kubernetes
Python
Terraform
Benefits
Competitive pay with equity options
Stellar health care plan options (Medical, Dental & Vision), with FSA, DCFSA, & HSA options
Company-sponsored disability & life insurance
Unlimited PTO
401(k) + 4% Matching
Fully remote work + flexible working hours
$750 work-from-home setup budget
Paid biannual in-person company summits
Quarterly $150 co-hanging stipend to meet up with coworkers