Reddit, Inc. is a community-driven platform that fosters open conversations on the internet. The Senior Staff Machine Learning Systems Engineer will lead the vision for systems powering the Reddit Home Feed, focusing on backend engineering and systems architecture to enhance user experience and business metrics.
Responsibilities:
- Architect High-Performance Retrieval: Partner with platform teams to design and implement flexible, ultra-low-latency candidate retrieval solutions that balance discovery with computational efficiency
- Optimize the Serving Lifecycle: Design and build "contributor-friendly" serving pipelines that allow backend and product engineers to safely inject logic and features without needing deep ML expertise
- Build Predictive Reliability: Lead the development of sophisticated shadow-testing infrastructure to evaluate distribution shifts and system performance at scale before code ever hits production
- Drive Cross-Stack Integration: Collaborate across the stack—from GraphQL layers to upstream infrastructure—to ensure the feed’s architecture is resilient, maintainable, and extensible
- Performance Engineering: Identify bottlenecks in the feed's ecosystem and implement systemic fixes that improve the user experience
Requirements:
- 10+ years of experience in backend software engineering, with a focus on architecting and scaling high-throughput, low-latency distributed systems for personalization or recommendation engines
- Expert-level understanding of distributed systems, including real-time data pipelines, advanced caching strategies (e.g., Redis, Memcached), and storage layer optimization
- Proven ability to lead technical strategy across organizational boundaries, partnering with Infrastructure and Platform teams to build extensible, 'contributor-friendly' architectures
- Adept at navigating complex system trade-offs, such as balancing architectural resilience and latency with the evolving needs of machine learning models
- Strong communicator and mentor who leads through collaboration and technical excellence, with experience influencing best practices across a large engineering organization