Temporal Technologies is on a mission to be the reliable foundation of every developer’s toolbox, and they are seeking a Staff Developer Success Engineer. This role will be the frontline technical expert for the developer community, focusing on deploying and scaling Temporal in cloud-native environments while troubleshooting complex infrastructure issues.
Responsibilities:
- Help users deploy and scale Temporal in cloud-native environments
- Troubleshoot complex infrastructure issues, optimize performance, and develop automation solutions
- Work with cloud-native, highly scalable infrastructure spanning AWS, GCP, Kubernetes, and microservices
- Gain deep expertise in container orchestration, networking, and observability while learning from complex, real-world customer use cases
- Debug complex infrastructure issues, optimize cloud performance, and enhance reliability for Temporal users
- Develop observability solutions (Grafana, Prometheus), improve networking (load balancing, DNS, ingress/egress), and automate infrastructure operations (Terraform, IaC)
- Independently drive technical solutions, whether debugging complex production issues or designing infrastructure best practices
- Engage directly with developers, engineering teams, and product teams to understand infrastructure challenges and provide solutions that enhance scalability, performance, and reliability
- Influence platform improvements, from enhancing observability tooling to developing self-service infrastructure solutions that simplify troubleshooting
- Serve as a bridge between developers and infrastructure, ensuring that reliability, performance, and developer experience remain top priorities as Temporal scales
Requirements:
- 9+ years of experience as a developer, preferably fluent in one or more of the following languages: Python, Java, Golang, TypeScript
- Experience with deployment and managing medium to large-scale architectures (e.g., Kubernetes or Docker)
- Experience with monitoring tools such as Prometheus and Grafana and troubleshooting performance and availability issues
- Minimum of one year experience in an internal or external customer-facing role
- Passion for helping others regardless of who they are or how they act
- Experience working with or as part of remote teams
- Strong written and verbal communication skills
- Seek to understand first, lead with data, and rely on facts
- Previous experience in customer-facing positions such as a professional services consultant, solutions architect, customer engineer, etc
- Experience with security certificate management and implementation
- Ability to understand use cases and translate them into Temporal design decisions and architecture best practices
- Technologies: EKS, GKE, Kubernetes, Prometheus, Grafana, OpenTracing, Terraform/Ansible/CDK