Operate and optimize Kubernetes clusters, Istio service mesh, and Linux-based systems
Automate workflows using Go, Python, and Shell scripting
Build monitoring and observability solutions with Prometheus, Grafana, and Loki
Troubleshoot complex networking, storage, and system performance issues
Participate in on-call rotations and postmortem reviews to improve system resilience
Partner with AI/ML teams to ensure infrastructure readiness for model training and data pipelines
Requirements
Experience with Google Cloud, plus IaC tools (Terraform)
Strong knowledge of microservices, containers (Kubernetes, Docker), and networking
SRE mindset with a focus on automation, scalability, and reliability
Hands-on experience with PKI, service mesh, and Linux systems administration
Tech Stack
Cloud
Docker
Grafana
Kubernetes
Linux
Microservices
Prometheus
Python
Shell Scripting
Terraform
Go
Benefits
Competitive total rewards package
Blog during work hours; take a day off and volunteer for your favorite charity
Flexibly work remotely from your home, there’s no daily travel requirement to an office!
All you need is a stable internet connection
Collaborate with some of the best and brightest in the industry!
Hone your skills or learn new ones with our substantial training allowance; participate in professional development days, attend training, become certified, whatever you like!
We give you all the equipment you need to work from home including a laptop with your choice of OS, and an annual budget to personalize your work environment!
Pythian cares about the health and well-being of our team. You will have an annual wellness budget to make yourself a priority (use it on gym memberships, massages, fitness and more)
Generous amount of paid vacation and sick days, as well as a day off to volunteer for your favorite charity