Adobe is a leading company that empowers creativity through innovative platforms and tools. They are seeking a Senior Site Reliability Engineer to enhance their Ethos Network Reliability team, focusing on building and supporting a container-based platform while ensuring network reliability and performance at scale.
Responsibilities:
- Work directly with an experienced team of engineers to learn about our systems and make meaningful contributions
- Enhance monitoring and alerting for Adobe’s API Gateway and cluster networking infrastructure
- Lead and participate in network debugging and triage during CSO events to identify root causes of latency and errors
- Propose and drive network optimizations for cross‑cloud communication across AWS and Azure
- Develop infrastructure-level components that power Adobe’s compute environments
- Use and contribute to numerous open-source tools and projects like Kubernetes and Envoy
- Operate in large-scale, cloud-based environments like AWS and Azure
- Work as part of a fast-paced, agile process with the freedom to work independently
Requirements:
- B.Sc. or higher in related field, or equivalent experience
- 5+ years of proven experience in software engineering, site reliability engineering, release engineering, and/or configuration management
- Experience with cloud service providers: Microsoft Azure, Amazon AWS
- Familiarity with scripting, CICD systems, Linux, networking, containerization, REST-based APIs, Kubernetes and API Gateways
- Experience with Python, Golang, or similar programming language
- An ability to coordinate and approve technical requirements documents, code reviews, service deployments and project plans
- Highly organized and able to balance multiple communication channels, schedules and meetings
- Strong written and oral skills
- A schedule which allows you to regularly meet between 7AM and 10AM pacific time, or 9AM and noon eastern time