Sequoia Capital Global Equities is associated with Reddit, a community platform that hosts conversations on various topics. They are seeking a Staff Machine Learning Engineer to lead the Commercial Content Understanding roadmap for the Monetization organization and act as the technical owner for ACU’s signals and ML systems.
Responsibilities:
- Provide technical leadership and mentorship to MLEs and SWEs doing ML work in ACU, acting as de facto tech lead for content understanding and signals: driving design reviews, setting technical standards, and uplifting the team’s modeling and systems craft
- Develop evaluation systems and quality monitoring systems for content understanding signals, using SOTA LM-judge practices
- Drive operational excellence for ACU’s ML systems by defining SLOs, alerting, and dashboards for key signals (coverage, latency, precision/recall, cost)
- Build and evolve content understanding capabilities for commercial conversations (e.g., reviews vs. recommendations vs. comparisons vs. Q&A; sentiment and stance; product entities and categories) and operationalize them as robust signals that power contextual and shopping ads, auto-targeting, new formats, and insights products
- Lead design and implementation of signals pipelines and produce an ACU signals registry. Partner with platform teams and other content understanding teams to ensure efficient, reliable serving at Reddit scale
- Drive LLM and modern ML best practices within ACU: define when to prompt, finetune, or distill; design evaluation and safety harnesses; and lead at least one major distillation effort to replace external APIs with in-house models
- Operate across the full ML lifecycle (problem framing, data, modeling, evaluation, deployment, monitoring, and oncall), designing scalable, resilient MLOps pipelines and championing responsible AI (bias, safety, explainability) for ACU’s models and signals in production
Requirements:
- 7+ years of relevant MLE experience delivering production ML systems (models + pipelines + serving) at scale, ideally in large-scale content understanding domains, or Ads
- Demonstrated Staff-level technical leadership: has driven architecture decisions, standards, and design reviews across multiple teams, and has aligned PMs, DSs, and engineers on shared ML systems or platforms without direct people-management authority
- Excellent communication skills, with the ability to explain complex technical trade-offs to PMs, DSs, and other engineering teams, especially in ambiguous, cross-team problem spaces like Seekers/Searchers monetization
- Strong track record building and shipping NLP / Language models / content understanding models to production (e.g., classifiers, encoders, sequence or session models), with clear business outcomes (e.g., CTR/ROAS uplift, safety improvements). Experience with commercial or intent modeling is a strong plus
- Practical experience using LLMs in production for labeling, evaluation, or distillation (e.g., LM-as-judge, prompt-based classifiers, LLM-generated labels distilled into smaller models), including managing quality, cost, and latency trade-offs
- Deep experience with PyTorch, TensorFlow, or similar, and production-quality code in Python (and ideally one statically typed language like Go/Java/C++). Comfortable owning training, evaluation, and deployment code end-to-end
- Experience designing ML systems and pipelines: offline training, feature pipelines (batch/streaming), online serving, monitoring, and experimentation for high-traffic surfaces