Instacart is transforming the grocery industry by providing essential services that customers rely on for grocery delivery. They are seeking a Senior Machine Learning Engineer II to lead the design and development of core machine learning models for their ads ecosystem, focusing on improving model calibration and addressing training data biases.
Responsibilities:
- Lead research and development of pCTR and conversion prediction models, with a focus on improving calibration, reducing training data biases (selection bias, position bias, optimizer’s curse), and advancing model accuracy across Instacart’s ads surfaces
- Design and implement debiasing techniques such as Mixed Negative Sampling (MNS), Inverse Propensity Weighting (IPW), counterfactual risk minimization, and calibration methods (Platt scaling, isotonic regression) to address systematic prediction biases
- Contribute to the next-generation Multi-Domain Multi-Task (MDMT) model architecture, incorporating innovations like Mixture-of-Experts (MoE), Transformer layers for sequential user behavior, and LoRA adaptors for scalable domain fine-tuning
- Drive sequence modeling initiatives including the TIGER generative retrieval system and Semantic ID representation learning, expanding their application across ads surfaces such as Product Details, Search and other placements
- Collaborate with the broader ML community in the company on the path toward Foundation Models using autoregressive user behavior prediction
- Formulate and scope ambiguous modeling problems from first principles. Translate business observations (e.g., overcalibration patterns, cold-start underperformance) into well-defined ML research directions with clear evaluation criteria
- Publish and present findings internally. Contribute to the team’s culture of technical rigor through design reviews, paper sharing, and experiment retrospectives
Requirements:
- PhD/Master in machine learning, statistics, computer science, information retrieval, or a closely related quantitative field
- 6+ years of combined academic and industry experience (including PhD research) applying ML to ranking, recommendation, or prediction problems at scale
- Deep understanding of CTR/conversion prediction modeling, including familiarity with architectures such as Deep & Wide, DeepFM, DCN, and multi-task learning formulations
- Strong foundation in causal inference, counterfactual reasoning, and training data bias mitigation
- Ability to reason about selection bias, position bias, and propensity-based correction methods
- Proficiency in Python and deep learning frameworks (PyTorch, Tensorflow, JAX)
- Fluency in data manipulation tools (SQL, Spark, Pandas)
- Track record of formulating ambiguous problems into well-scoped ML research directions and delivering results through rigorous experimentation
- Strong written and verbal communication skills
- Ability to explain complex modeling decisions to cross-functional stakeholders including product managers and data scientists
- Experience in ads ranking or auction-based systems (pCTR, bid optimization, ROAS feedback loops, marketplace dynamics)
- Hands-on experience with autoregressive sequence models for user behavior prediction, generative retrieval, or transformer-based ranking architectures
- Familiarity with learned representations such as Semantic IDs, product embeddings, or other approaches to reducing feature cardinality and cold-start challenges
- Experience with transfer learning or domain adaptation techniques (e.g., LoRA, adapter-based fine-tuning) applied to recommendation or ranking models
- Publication record in top-tier venues (KDD, WWW, RecSys, NeurIPS, ICML, SIGIR, or similar)
- Experience mentoring junior engineers or shaping technical direction for a modeling team
- Familiarity with LLM-driven approaches to recommendation, including prompt-based personalization and AI-assisted model development (AutoML)