Zillow is a leading real estate platform that is transforming the industry through AI technologies. They are seeking a Principal Machine Learning Engineer to develop advanced multimodal foundational technologies and drive product innovation by prototyping and deploying AI models within their scalable infrastructure.
Responsibilities:
- Leverage frameworks like AgentSDK, and LangChain/LangGraph to design, prototype, and develop multi-agent systems that are capable of highly autonomous and context-aware interactions
- Leverage advanced GenAI models including reasoning models, real-time voice API, etc, to build agentic prototypes and later on convert them into product-level agentic skills and deploy to users
- Have the mentality of “build, learn, and pivot”, and hold the high bar of shipping into production
- Mentor and guide engineers in using the right technologies for agentic AI solutions and foster a culture of innovation and responsible AI usage
- Distill complex research findings and system designs into actionable insights for diverse audiences—including executives
- Remain on the cutting edge of agentic AI emerging paradigms, driving product innovation
- Serve as the focal point for applied science projects, driving alignment on timelines, and prioritization
- Continuously refine processes for experimentation, A/B testing, and production rollouts to balance rapid innovation with reliability and responsible deployment
Requirements:
- A master's degree or above, equivalent experience in Computer Science, Electrical Engineering, or a related field with emphasis on foundational LLM, agentic AI, reinforcement learning, AI planning, or natural language processing
- 7+ years of hands-on work building large-scale, high-impact solutions—ideally in the most recent two years building agent-based systems, multi-agent collaboration, or similar paradigms
- Experience developing dialogue systems capable of long conversations, multi-step reasoning, context-rich decision-making
- Experience deploying and scaling AI services capable of handling hundreds of millions of daily interactions with high availability, low latency, and robust fault tolerance
- A track record of publishing high-impact research in top AI/ML venues is a big plus