Design, build, and maintain application deployment, packaging, and operational tooling in support of the Core Applications team, spanning both SaaS and on-premise delivery models.
Enhance application-level observability and reliability signals, including monitoring, alerting, and SLOs, in partnership with the delivery teams.
Collaborate closely with application engineers and the delivery teams to define, measure, and improve performance, deployment patterns, and operational readiness across application and database layers.
Help enforce and improve security and compliance standards, including SOC 2 controls.
Contribute to documentation, onboarding materials, and internal support processes.
Participate as a secondary or escalation support resource as needed
Requirements
5+ years of experience in DevOps, SRE, or platform engineering roles.
2+ years of experience at a B2B software startup.
Hands-on experience deploying and supporting Django applications in production, including application servers, background jobs, and migrations.
Strong experience operating Postgres-backed systems at scale, including schema design implications, migrations, performance analysis, and reliability considerations.
Demonstrated ability to influence application performance beyond infrastructure tuning, working with engineers on query patterns, caching strategies, and architectural tradeoffs.
Strong experience with CI/CD pipelines and automation tooling—ideally GitHub Actions.
Strong experience with AWS (EC2, VPC, IAM, RDS, etc.), ideally EKS/Kubernetes and infrastructure-as-code (Terraform, Helm).
Familiarity with observability tools like Prometheus, Grafana, Mimir, Loki, or similar.
Proficiency in Python, Go, or shell scripting.
Comfortable operating in a fast-paced, ambiguous startup environment.