Owen Thomas is a B Corp™ company seeking a Senior Platform Engineer to support a global sports platform during the FIFA World Cup. The role involves monitoring and supporting live production systems, collaborating with engineering teams, and ensuring platform reliability during high-traffic events.
Responsibilities:
- Monitor and support live production systems during major sporting events
- Act as first-line engineering support across platform infrastructure and backend services
- Investigate and resolve production issues across AWS and distributed cloud environments
- Collaborate closely with the core engineering team to understand systems, tooling, and operational workflows
- Support backend services built primarily with TypeScript and Node.js
- Contribute to platform reliability, observability, alerting, and incident response processes
- Work across CI/CD pipelines, deployments, monitoring, and cloud infrastructure
- Debug application and infrastructure issues across containerised environments
- Assist with scaling, performance optimisation, and operational readiness during periods of peak traffic
- Document incidents, operational runbooks, and platform improvements clearly
- Utilise AI-native tooling such as Cursor, Claude Code, Copilot, or similar to improve engineering efficiency and troubleshooting workflows
Requirements:
- Strong commercial experience with Amazon Web Services (AWS) within production-scale environments
- Strong commercial experience with Platform Engineering, DevOps, or Site Reliability Engineering (SRE)
- Strong commercial experience with TypeScript and Node.js backend environments
- Strong commercial experience with production monitoring, observability, logging, and alerting systems
- Strong commercial experience with CI/CD pipelines and deployment tooling
- Strong commercial experience with containerised environments using Docker
- Strong commercial experience with debugging and supporting distributed systems and APIs
- Strong commercial experience with performance optimisation, scaling, and reliability engineering
- Strong commercial experience with incident management and live production support
- Strong commercial experience with infrastructure and operational troubleshooting within cloud-native environments
- Strong commercial experience with AI-native development workflows using tools such as Cursor, Claude Code, GitHub Copilot, or similar
- Strong communication skills and the ability to operate independently
- Experience with C# or Python
- Experience supporting high-traffic consumer-facing platforms
- Experience within SportsTech, MediaTech, streaming, or real-time event-driven systems
- Exposure to load balancing, API gateways, and caching strategies
- Experience supporting globally distributed systems during live events
- Interest in football/soccer or live sports platforms