Luma is on a mission to build multimodal AI that expands human imagination, creating an AI agent designed to amplify human creativity. The role involves architecting a novel visual reasoning system, rapidly prototyping agentic workflows, and building scalable systems that enhance creative interactions for millions of users.
Responsibilities:
- Invent from First Principles: Architect the core of our agentic stack, moving beyond standard patterns to create a truly novel visual reasoning system
- Ship Frontier Prototypes: Rapidly prototype and ship breakthrough agentic workflows that explore the absolute edge of creative AI, pushing past the limitations of today's VLMs and RAG systems
- Build for Scale & Magic: Own the systems that make these experiences possible, ensuring they are low-latency, high-fidelity, and reliable for millions of creators
- Bridge Research to Product: Partner directly with our world-class research and design teams to transform bleeding-edge models into magical, intuitive product experiences
Requirements:
- 5+ years of professional engineering experience OR an equivalent portfolio of advanced, relevant projects
- Demonstrated ability to build experiences that cross modalities, with VLMs, diffusion models, or other multimodal systems (speech, video, etc.)
- Strong proficiency in Python and a solid understanding of backend systems and databases
- Deep experience with AI/LLM frameworks with a track record of pushing beyond their standard functionality
- Experience in Rust or Go
- Experience with fine-tuning, reinforcement learning, etc
- Familiarity with Kubernetes, Docker, or modern cloud infrastructure
- Background in multimodal systems, especially creative tools involving video or images
- Highly Encouraged: Portfolio links, open-source contributions, or notable hackathon projects