10a Labs is the safety and threat-intelligence layer trusted by frontier AI labs and leading global technology platforms. They are seeking an infrastructure-focused engineer to deploy, monitor, and scale a real-time ML-powered content moderation system, working closely with ML engineers and clients.
Responsibilities:
- Design and maintain cloud infrastructure (GCP or AWS) to support real-time model serving, data ingestion, and evaluation workflows
- Deploy and optimize APIs for low-latency access to ML models and embedding search systems
- Manage and optimize the end-to-end training data flow—from sourcing and cleaning datasets to preparing them for model consumption—ensuring accuracy, scalability, and efficiency
- Build observability tooling for production ML pipelines (monitor latency, error rates, request volumes, drift)
- Automate model deployment, retraining, and evaluation pipelines (CI/CD for ML)
- Work with ML engineers to package models for serving
- Help manage vector databases and semantic search infrastructure (e.g., Pinecone, FAISS, Vertex Matching Engine)
- Ensure security, compliance, and uptime of infrastructure supporting safety-critical systems