Zillow is a leading real estate platform dedicated to transforming the home buying experience through AI technology. The Principal Machine Learning Engineer will develop advanced multimodal foundational technologies, mentor engineers, and drive product innovation in agentic AI solutions.
Responsibilities:
- Leverage frameworks like AgentSDK, and LangChain/LangGraph to design, prototype, and develop multi-agent systems that are capable of highly autonomous and context-aware interactions
- Leverage advanced GenAI models including reasoning models, real-time voice API, etc, to build agentic prototypes and later on convert them into product-level agentic skills and deploy to users
- Have the mentality of “build, learn, and pivot”, and hold the high bar of shipping into production
- Mentor and guide engineers in using the right technologies for agentic AI solutions and foster a culture of innovation and responsible AI usage
- Distill complex research findings and system designs into actionable insights for diverse audiences—including executives
- Remain on the cutting edge of agentic AI emerging paradigms, driving product innovation
- Serve as the focal point for applied science projects, driving alignment on timelines, and prioritization
- Continuously refine processes for experimentation, A/B testing, and production rollouts to balance rapid innovation with reliability and responsible deployment
Requirements:
- A master's degree or above, equivalent experience in Computer Science, Electrical Engineering, or a related field with emphasis on foundational LLM, agentic AI, reinforcement learning, AI planning, or natural language processing
- 7+ years of hands-on work building large-scale, high-impact solutions—ideally in the most recent two years building agent-based systems, multi-agent collaboration, or similar paradigms
- Experience developing dialogue systems capable of long conversations, multi-step reasoning, context-rich decision-making
- Experience deploying and scaling AI services capable of handling hundreds of millions of daily interactions with high availability, low latency, and robust fault tolerance
- A track record of publishing high-impact research in top AI/ML venues is a big plus