Serko is a cutting-edge tech platform in global business travel & expense technology, seeking a Principal Engineer to serve as the technical expert for their AI Platform & Operations. The role focuses on defining the technical vision for foundational systems that will support AI product teams, ensuring reliability and efficiency in deploying AI models.
Responsibilities:
- Architect the Future: Define the long-term technical roadmap for our AI platform, covering everything from model serving and feature stores to experiment tracking and CI/CD for ML
- Set the Gold Standard: Establish engineering benchmarks for deployment, versioning, A/B testing, and automated rollbacks
- Optimize & Scale: Lead strategies for GPU/compute efficiency and cost optimization while managing the complexities of LLM inference (quantization, batching, and latency) at scale
- Enhance Observability: Design sophisticated monitoring and alerting systems specifically tailored for AI workloads in production
- Champion Reliability: Drive platform stability and partner with application teams to ensure our infrastructure meets evolving product needs
- Technical Leadership: Mentor Senior engineers, lead architecture reviews, and evaluate the next generation of cloud services and ML frameworks