Design, develop, and operate backend services that power Arbiter's core platform, with an emphasis on reliability, modularity, and clean system boundaries.
Build and maintain robust connections to third-party systems (e.g. cloud APIs, AI services, data exchange services, EHRs, telephony platforms). Own the abstractions that make these integrations reusable and adaptable across customers with minimal rework.
Design and operate high-scale APIs (REST, gRPC, webhooks) and manage complex data flows including real-time streaming, batch processing, file-based exchange (e.g. SFTP, HL7, EDI), and event-driven pipelines.
Ensure high throughput, low latency, and fault tolerance across backend services through strong system design, monitoring, alerting, and operational best practices. Handle vendor failures, retries, idempotency, and graceful degradation.
Build and maintain ETL/ELT pipelines, manage schema evolution, and ensure data quality and integrity across systems with varying formats, standards, and reliability.
Implement and uphold best practices for CI/CD, testing, observability, and deployment of backend systems in production cloud environments.
Partner closely with AI engineers, product managers, implementation teams, and customer stakeholders to translate ambiguous, high-impact problems into scalable technical solutions.
Mentor engineers, contribute to internal documentation and standards, influence technical direction, and raise the overall engineering bar.
Take end-to-end ownership of critical systems, including participating in on-call rotations and leading incident resolution when production issues arise.
Requirements
5+ years of hands-on experience building and operating production backend systems in high-availability environments.
Computer Science or Engineering degree, or equivalent practical experience.
Experience building and maintaining large-scale Python codebases with strong opinions on structure, quality, and tradeoffs.
Deep understanding of API design patterns, versioning, backward compatibility, and managing breaking changes across consumers.
Experience building reusable abstraction layers or connector frameworks that allow a single integration pattern to serve multiple customers or vendors.
Proven experience designing systems that connect to third-party services, including handling authentication, rate limiting, retry logic, and failure modes gracefully.
Strong understanding of concurrency, scalability, reliability, and distributed systems patterns.
Hands-on experience with data pipeline architectures: batch and streaming, schema management, and data quality enforcement.
Experience with cloud infrastructure (AWS, GCP, or Azure) and production deployments.
Strong communication skills and ability to work effectively across functions.
Proficiency with AI-assisted development tools (e.g., Cursor, Claude Code, GitHub Copilot).
Track record of delivering complex systems end-to-end with minimal oversight.
Tech Stack
AWS
Azure
Cloud
Distributed Systems
ETL
Google Cloud Platform
GRPC
Python
Benefits
Highly Competitive Salary & Equity Package: Designed to rival top FAANG compensation, including meaningful equity.
Generous Paid Time Off (PTO): To ensure a healthy work-life balance.
Comprehensive Health, Vision, and Dental Insurance: Robust coverage for you and your family.
Life and Disability Insurance: Providing financial security.
Simple IRA Matching: To support your long-term financial goals.
Professional Development Budget: Support for conferences, courses, and certifications to fuel your continuous learning.
Wellness Programs: Initiatives to support your physical and mental health.