Supio is looking for a hands-on Site Reliability Engineer to help shape and scale the reliability layer of their stack. This role involves owning the release pipeline, managing deployments, and automating infrastructure while working closely with engineers and product leads to ensure uptime and confidence in every deploy.
Responsibilities:
- Own Deployments: Lead our release and deployment process — from daily rollouts to weekly deploys and hotfix coordination. Build safe, repeatable, and observable workflows
- GitHub Operations: Manage GitHub branching strategies, pull request flows, merge policies, and GitHub Actions. Set and enforce collaboration standards for the engineering team
- Infrastructure & Monitoring: Build and maintain resilient AWS-based infrastructure. Set up and manage observability tools (logs, metrics, traces), configure alarms, and be the first responder for incidents. Triage, escalate, or resolve based on impact
- Automation & Internal Tooling: Write scripts, services, and automations that reduce friction and improve deployment confidence. Using AI tools to generate code is encouraged and expected — you'll be comfortable guiding, adapting, and integrating AI-assisted outputs into production workflows
- Software Development: You’ll contribute code when needed — whether that’s building internal tools, improving system reliability, or unblocking a deploy. This is not a sprint-based role, but strong software fundamentals are key to success
- Support Global Teams: Work off-hours as needed to unblock offshore teams and maintain deployment velocity across time zones
Requirements:
- 3–6+ years in SRE, DevOps, or infrastructure roles with production ownership
- Started your career in software development — and still enjoy writing code
- Fluent in or at least familiar with Bash, Python, TypeScript, and Postgres SQL
- Confident AWS operator and know your way around EC2, Lambda, RDS, IAM, and VPCs
- Strong experience with GitHub workflows, including GitHub Actions and release automation
- Comfortable using AI tools (Claude, ChatGPT, etc.) to generate code — and have the skill to audit and adapt that code to meet production standards
- Familiar with CI/CD principles and enjoy owning the full deployment lifecycle
- Comfortable being on-call and understand how to design systems for both speed and safety
- Can operate with a high level of autonomy in fast-moving, ambiguous environments