Boulevard is a client experience platform for appointment-based self-care businesses, and they are seeking a Staff Database Reliability Engineer to enhance database reliability and scalability. This role involves leading initiatives for robust database platforms, optimizing performance, and mentoring engineering teams to foster a culture of reliability.
Responsibilities:
- Develop a deep understanding of how Boulevard’s systems behave, scale, interact, and fail, and use that insight to identify risks and improvement opportunities
- Own and improve database reliability, performance, and scalability; participate in incident response and drive architectural improvements that reduce incident frequency and impact
- Partner with engineering teams to design, build, and operate scalable, fault-tolerant, and secure distributed systems that support Boulevard’s growth and customer trust
- Build tools, automation, and frameworks that eliminate toil, reduce operational overhead, and establish best practices used across engineering teams
- Elevate observability and operational excellence through actionable metrics, alerts, and dashboards that enable faster incident resolution and proactive reliability improvements
- Mentor and influence engineers across the organization, helping foster a culture where reliability is a shared responsibility
Requirements:
- 8–10+ years of experience in systems, infrastructure, or backend software engineering, with a strong focus on RDBMS and NoSQL systems
- Production experience with managed cloud databases such as AWS Aurora/RDS (PostgreSQL), and deploying/managing infrastructure using infrastructure-as-code tools
- Proven experience delivering reliability outcomes using SLOs, SLIs, error budgets, and mature observability practices
- Strong background in automation, scripting, and infrastructure-as-code (e.g., Terraform, Python, Go, or similar)
- Experience diagnosing and mitigating production incidents in high-availability systems, with a focus on learning and continuous improvement
- Excellent communication skills and the ability to influence without authority across engineering teams
- Demonstrated ability to set technical standards, mentor engineers, and scale impact through others
- Ability to navigate uncertainty, set direction, and iterate toward meaningful outcomes in a fast-moving environment
- Experience with Elixir, Phoenix, Ruby, or Rails
- Hands-on experience identifying and improving database performance