Build, test, and deploy applications to deliver critical global communications services.
Identify and resolve issues related to latency, scalability, and performance.
Create tools and AI agents to automate revenue operations.
Investigate and resolve production issues by tracing through logs, data, and code to find root causes.
Requirements
5+ years of professional software development experience with an object-oriented programming language
Experience with web application development using Python (FastAPI, aiohttp, or similar).
Experience designing and building RESTful APIs.
Experience working with asynchronous programming
Comfort working with PostgreSQL and relational databases at scale, including schema migrations, raw SQL queries and query optimization.
Experience with event-driven architectures and message brokers (RabbitMQ, Kafka or similar).
Experience in a highly available, uptime-critical production environment.
Deep knowledge of microservice architecture patterns and anti-patterns
Familiarity with Docker, Kubernetes, Consul, GitHub Actions, Argo CD, Prometheus, or OpenTelemetry.
Strong troubleshooting skills — you can dig through logs, query databases, trace distributed systems, and read unfamiliar code to get to the bottom of an issue.
Comfortable owning services in production, including on-call responsibilities and incident resolution.
Comfortable using AI coding agents as part of your investigation workflow, while knowing when to rely on your own judgement.