Infinity Constellation is a company focused on helping small-to-medium businesses hire their first AI employee. They are seeking a Staff/Principal Software Engineer to own and evolve the core platform that powers their AI employees, driving architectural decisions and ensuring the scalability and reliability of the platform.
Responsibilities:
- Drive platform architecture decisions and align the team on scalable patterns and long-term maintainability
- Review a high volume of code, design docs, and architectural proposals for scalability, reliability, security, and operability
- Be a technical mentor and force multiplier: unblock engineers, raise the bar on production readiness, and establish platform best practices
- Own and evolve the core backend platform (Django/DRF/ASGI) performance and correctness
- Scale async execution across Celery + Dramatiq + Temporal/Cortex; implement resilient workflow patterns (retries, circuit breakers, graceful degradation)
- Optimize PostgreSQL/pgvector (query tuning, connection pooling) and caching strategies
- Maintain and improve Kubernetes deployment infrastructure (GKE, Helm, Terraform/OpenTofu) and CI/CD + rollout strategies. Own KEDA autoscaling policies and resource allocation across worker pools. Own reliability of RabbitMQ, Redis, and PostgreSQL infrastructure; lead incident response and post-mortems
- Extend OpenTelemetry + Datadog instrumentation, dashboards, alerts, and SLOs; profile and reduce latency/memory bottlenecks