Temporal Technologies is an open source programming model company on a mission to simplify code and enhance developer experience. The Staff Developer Success Engineer will act as a technical expert for the developer community, helping users deploy and scale Temporal in cloud-native environments while troubleshooting complex infrastructure issues and optimizing performance.
Responsibilities:
- Be a keen learner: At Temporal, you’ll work with cloud-native, highly scalable infrastructure spanning AWS, GCP, Kubernetes, and microservices. You’ll gain deep expertise in container orchestration, networking, and observability while learning from complex, real-world customer use cases. Our stack includes Go, Python, and Java, providing continuous opportunities to hone your programming skills in infrastructure automation, resilience engineering, and performance tuning
- Be a Passionate Problem Solver: If you enjoy tackling scalability, reliability, and troubleshooting challenges in distributed systems, you’ll thrive in this role. As a Staff Developer Success Engineer, you’ll work directly with developers to debug complex infrastructure issues, optimize cloud performance, and enhance reliability for Temporal users. You'll develop observability solutions (Grafana, Prometheus), improve networking (load balancing, DNS, ingress/egress), and automate infrastructure operations (Terraform, IaC) to help customers run Temporal efficiently at scale. Once ramped up, we expect you to independently drive technical solutions, whether debugging complex production issues or designing infrastructure best practices. Don't worry, we have seasoned engineers and mentors to support you along the way!
- Be a Great Communicator: As a Staff Developer Success Engineer you will engage directly with developers, engineering teams, and product teams to understand infrastructure challenges and provide solutions that enhance scalability, performance, and reliability. Your insights will influence platform improvements, from enhancing observability tooling to developing self-service infrastructure solutions that simplify troubleshooting (e.g., building diagnostic tools similar to Twilio’s Network Test). You’ll serve as a bridge between developers and infrastructure, ensuring that reliability, performance, and developer experience remain top priorities as Temporal scales
Requirements:
- 9+ years of experience as a developer, preferably fluent in one or more of the following languages: Python, Java, Golang, TypeScript
- Experience with deployment and managing medium to large-scale architectures (e.g., Kubernetes or Docker)
- Experience with monitoring tools such as Prometheus and Grafana and troubleshooting performance and availability issues
- Minimum of one year experience in an internal or external customer-facing role
- Passion for helping others regardless of who they are or how they act
- Experience working with or as part of remote teams
- Strong written and verbal communication skills
- Seek to understand first, lead with data, and rely on facts
- Previous experience in customer-facing positions such as a professional services consultant, solutions architect, customer engineer, etc
- Experience with security certificate management and implementation
- Ability to understand use cases and translate them into Temporal design decisions and architecture best practices
- Technologies: EKS, GKE, Kubernetes, Prometheus, Grafana, OpenTracing, Terraform/Ansible/CDK