Block is a company focused on economic empowerment and is seeking a Senior Site Reliability Engineer to enhance the reliability of its platform and infrastructure. The role involves improving observability and incident response using AI-driven tooling, leading incident command, and driving reliability improvements across the organization.
Responsibilities:
- Build and extend platforms to improve system reliability
- Work on team goals that encompass reliability for the entire company
- Standardize reliability tools across multiple platforms and organizations
- Triage, coordinate, and lead stabilization of sev 0–1 incidents
- Serve as primary oncall, maintaining structured escalation paths and exercising leadership escalation
- Drive platform-wide reliability improvements, shared operational tooling, and deploy-safety patterns
- Use AI-driven systems to improve signal detection, reduce noise, and accelerate root cause analysis
- Design and implement safe deployment patterns (progressive delivery, automated rollback, guardrails)
Requirements:
- Drive to root cause systems with many moving parts and take the necessary steps to fix them
- Demonstrated technical initiative and leadership on previous projects, especially those with a backend/platform focus
- Familiarity with AI-driven tooling for observability, incident analysis, or automation
- A mindset that naturally reaches for AI to accelerate problem-solving and reduce toil
- Experience running production oncall for high-availability systems
- Strong incident management skills — structured triage, mitigation under pressure, blameless postmortems
- Fluency with CI/CD pipelines, progressive rollout strategies, and rollback automation
- Monitoring & observability expertise — building/tuning alerts for uptime, error rates, latency regression, and resource exhaustion
- Ability to create and maintain evidence-based maturity assessments using trailing 90-day data windows
- Comfort with vendor/dependency management — maintaining validated escalation contacts reachable within ≤ 5 minutes
- Boundless curiosity, autonomy, and a strong sense of accountability
- A strong desire to perform and grow as an engineer
- 5+ years of software development experience