Own core architecture across compute, networking, identity, and guardrails
delivered as code (Terraform/Terragrunt).
Architect, build, and maintain highly available and scalable infrastructure to support the OddsFactory engineering teams.
Drive secure-by-default and Zero Trust (least privilege, workload identity/OIDC, policy-as-code, supply-chain hardening).
Design and implement robust CI/CD pipelines, automation tools, and developer tooling to streamline deployments, improve code quality, and enhance team productivity.
Lead initiatives to enhance platform security, implementing best practices to ensure compliance and safeguard the integrity of our systems.
Champion infrastructure as code (IaC) practices, automating provisioning, configuration, and management of cloud-based environments.
Foster a culture of metrics driven operational excellence, sharing knowledge, mentoring engineers, and promoting best practices across teams.
Proactively identify, troubleshoot, and resolve infrastructure, performance, and scalability issues, ensuring system reliability and optimal performance.
Stay abreast of emerging technologies and industry trends, advocating for adoption of new tools and techniques that align with our platform strategy.
Participate in an on-call rotation, ensuring platform stability and providing critical support for operational incidents.
Requirements
8+ years of proven experience in platform engineering, DevOps, or related infrastructure roles.
Strong software engineering background
you write clean code and have experience engineering solutions to complex problem statements
Strong technical expertise in cloud-native environments (Azure preferred), infrastructure automation, and container orchestration (Service Fabric or Kubernetes).
Demonstrated proficiency in Infrastructure as Code systems such as Terraform, Azure Resource Manager, or CloudFormation.
Proficiency with programming and scripting languages (.NET & C# preferred; familiarity with Go or Python is a bonus).
Experience with observability tooling, chaos testing, and incident management.
Excellent influencing, problem-solving, and analytical skills, with demonstrated ability to partner closely with engineering teams.
Highly outcome-oriented, data-driven, and capable of balancing quality with productivity.
Strong communication skills, able to effectively collaborate across international teams.
Positive and flexible attitude, comfortable working in a fast-paced environment and embracing new initiatives.
Tech Stack
Azure
Cloud
Kubernetes
Python
Terraform
Go
.NET
Benefits
Occasionally travel for essential offsite meetings, special events, or collaborative team sessions.