10a Labs is the safety and threat-intelligence layer trusted by frontier AI labs and leading global technology platforms. They are seeking an infrastructure-focused engineer to deploy, monitor, and scale a real-time ML-powered content moderation system, working closely with ML engineers and clients.

Responsibilities:

Design and maintain cloud infrastructure (GCP or AWS) to support real-time model serving, data ingestion, and evaluation workflows
Deploy and optimize APIs for low-latency access to ML models and embedding search systems
Manage and optimize the end-to-end training data flow—from sourcing and cleaning datasets to preparing them for model consumption—ensuring accuracy, scalability, and efficiency
Build observability tooling for production ML pipelines (monitor latency, error rates, request volumes, drift)
Automate model deployment, retraining, and evaluation pipelines (CI/CD for ML)
Work with ML engineers to package models for serving
Help manage vector databases and semantic search infrastructure (e.g., Pinecone, FAISS, Vertex Matching Engine)
Ensure security, compliance, and uptime of infrastructure supporting safety-critical systems

MLOps / Infrastructure Engineer

Key skills

About this role

Responsibilities: