Phantom is a modern money app used by tens of millions around the world, combining tools for managing, spending, and growing money in one intuitive experience. The Staff SRE Engineer will manage Kubernetes clusters, implement infrastructure automation, optimize system performance, and collaborate with teams to support feature development and system scaling.
Responsibilities:
- Kubernetes Ownership: Manage and scale Kubernetes clusters on AWS EKS, ensuring reliability, performance, and security
- Infrastructure Automation: Implement and maintain Infrastructure-as-Code (Terraform/Pulumi) to automate infrastructure provisioning and management
- Performance Optimization: Monitor and optimize system performance, scalability, and resource utilization
- Blockchain Infrastructure: Configure and maintain crypto nodes across multiple blockchains to support our wallet’s operations
- Database Scaling: Optimize and scale database infrastructure to handle terabytes of blockchain data efficiently
- System Reliability: Continuously improve system uptime, monitoring, and observability using tools like Datadog and OpenTelemetry
- Collaboration: Work closely with backend and product teams to support feature development and system scaling
Requirements:
- 5+ years in a SRE or Software Engineer role
- Strong hands-on experience with Kubernetes (EKS) in production environments
- Proficiency with AWS infrastructure and services (EC2, S3, RDS, IAM)
- Solid experience with Docker and Infrastructure-as-Code tools like Terraform or Pulumi
- Monitoring and observability experience using tools like Datadog or OpenTelemetry