Hiive is a fintech/marketplace startup recognized as one of Canada’s top startups, connecting buyers and sellers of stock in venture-backed pre-IPO companies. As the Associate Director of Engineering, Platform, you will lead the DevOps and Site Reliability Engineering teams, focusing on operational success, technical strategy, and team growth.
Responsibilities:
- Leading, mentoring, and growing teams across DevOps and SRE, fostering collaboration, innovation, and continuous improvement
- Overseeing the platform engineering teams, ensuring alignment with organizational goals and maintaining high performance across all initiatives
- Conducting regular performance evaluations, providing actionable feedback, and supporting professional development and career progression
- Managing hiring, onboarding, and scaling efforts as the team and organization grow
- Assisting in scaling the organization by leading hiring efforts and helping to identify improvements in processes and workflows
- Guiding technical architecture decisions related to platform, infrastructure, automation, and observability, ensuring solutions are scalable, maintainable, and align with industry best practices
- Reviewing technical designs, providing expert insights for improvement
- Proactively identifying gaps and advocating for best practices in reliability, security, automation, and developer experience, and advocate for effective solutions
- Establishing and upholding standards for platform stability, reliability, quality, performance, and security, ensuring robust monitoring, validation, and response practices
- Championing continuous improvements to developer productivity and platform usability, supporting product engineering teams in achieving their best work
- Owning incident response processes, including resolution, post-mortems, root-cause analysis, and continuous improvement activities
Requirements:
- Experience in senior leadership roles managing DevOps and SRE
- Previous work experience at a scaling startup
- Proven experience in platform architecture, site reliability engineering, automation, and DevOps methodologies
- Demonstrated success in defining platform strategy, architecture, reliability, and automation
- Strong people management skills with proven ability to mentor, grow, and lead engineering teams
- Excellent communication skills, with the ability to clearly engage both technical teams and non-technical stakeholders across product and engineering
- Experience with Amazon Web Services (AWS), particularly with strong proficiency in EKS, RDS, and VPC
- Experience with Kubernetes, Terraform, GitHub Actions for CI/CD pipelines, and Datadog for observability tooling
- Previous experience working with and optimizing PostgreSQL
- Experience with Cloudflare and Vercel
- Experience with Playwright or similar test automation frameworks
- Strong understanding of security best practices in platform engineering
- Experience in regulated or high-compliance environments
- Experience completing SOC2 or similar certifications
- Experience working with Elixir