Own major parts of the Workflow Engine architecture and help drive how it evolves as the product and customer base scale.
Design, build, and operate backend services for workflow ingestion, orchestration, execution, and observability using TypeScript (NestJS), Postgres, Redis, Kafka, and Kubernetes.
Build systems that process external and internal events, evaluate workflow triggers and logic, and execute multi-step workflows reliably across distributed infrastructure.
Improve how we manage workflow state, including long-running executions, step coordination, cancellations, replay, and auditability.
Raise the bar on reliability and observability: logging, metrics, tracing, dashboards, alerting, and tooling that helps both engineers and customers understand what happened in a workflow run.
Partner closely with product and design to shape workflow capabilities, execution behavior, and developer-facing abstractions that are both powerful and understandable.
Requirements
6+ years of experience building and operating backend systems in production, ideally with TypeScript / Node.js (NestJS or similar framework).
Strong experience building distributed systems and reasoning clearly about reliability, correctness, and tradeoffs in production environments.
Hands-on experience with event-driven architectures and technologies like Kafka.
Strong practical experience building asynchronous processing systems.
Deep experience with Postgres.
Strong practical experience using Redis for caching, coordination, locking, rate limiting, or transient execution state.
Solid experience shipping and debugging services in Kubernetes in a cloud environment.
Experience building multi-tenant systems with strong reliability and isolation guarantees.