Tulip is a leader in AI-native frontline operations, helping companies enhance their workforce through connected apps. The Senior DevOps Engineer will own the deployment and continuous improvement of Tulip's multi-cloud environments, ensuring stability and performance while driving automation and collaboration with application engineering teams.
Responsibilities:
- Own the deployment, health, and continuous improvement of Tulip's multi-cloud, multi-region SaaS environments — including clusters spanning the US, Europe, and Asia
- Design and evolve cloud architecture to ensure customer availability, stability, and performance as Tulip scales globally
- Own and continuously improve Tulip's CI/CD infrastructure, driving toward a fully automated, human-interaction-free software delivery lifecycle
- Build automation tooling and internal systems that reduce operational toil and increase developer velocity — if it can be automated, automate it
- Define and maintain observability standards across Tulip's cloud environments, including metrics, alerting, logging, and distributed tracing
- Proactively identify performance degradation and capacity risks before they impact customers; lead incident response and drive root cause analysis
- Serve as a close partner to application engineering teams throughout the software development lifecycle, providing infrastructure guidance and support
- Participate in the on-call rotation and contribute to a culture of continuous improvement through documentation, runbooks, and process iteration
Requirements:
- United States Citizenship due to the nature of the assignments
- 5-7+ years of hands-on DevOps or Infrastructure Engineering experience, with demonstrated ownership of production cloud environments at scale
- Proficiency with modern cloud infrastructure tooling — experience with Kubernetes, Helm, Terraform, Ansible, and major cloud providers (AWS and/or Azure) is highly relevant
- Experience managing enterprise-grade data persistence layers, including NoSQL and SQL databases, key/value stores, and messaging systems (e.g., AMQP, MQTT)
- Familiarity with observability and monitoring tooling (e.g., Prometheus, Mimir, Thanos, Grafana) and a strong understanding of what good SRE practice looks like in a fast-growing SaaS environment
- Exposure to modern programming or scripting languages used in infrastructure contexts (e.g., Go, TypeScript, Python, Bash)
- Bachelor's degree in Computer Science, Engineering, or equivalent practical experience