Luma is on a mission to build multimodal AI that expands human imagination, creating an AI agent designed to amplify human creativity. The role involves architecting a novel visual reasoning system, rapidly prototyping agentic workflows, and building scalable systems that enhance creative interactions for millions of users.

Responsibilities:

Invent from First Principles: Architect the core of our agentic stack, moving beyond standard patterns to create a truly novel visual reasoning system
Ship Frontier Prototypes: Rapidly prototype and ship breakthrough agentic workflows that explore the absolute edge of creative AI, pushing past the limitations of today's VLMs and RAG systems
Build for Scale & Magic: Own the systems that make these experiences possible, ensuring they are low-latency, high-fidelity, and reliable for millions of creators
Bridge Research to Product: Partner directly with our world-class research and design teams to transform bleeding-edge models into magical, intuitive product experiences

Requirements:

5+ years of professional engineering experience OR an equivalent portfolio of advanced, relevant projects
Demonstrated ability to build experiences that cross modalities, with VLMs, diffusion models, or other multimodal systems (speech, video, etc.)
Strong proficiency in Python and a solid understanding of backend systems and databases
Deep experience with AI/LLM frameworks with a track record of pushing beyond their standard functionality
Experience in Rust or Go
Experience with fine-tuning, reinforcement learning, etc
Familiarity with Kubernetes, Docker, or modern cloud infrastructure
Background in multimodal systems, especially creative tools involving video or images
Highly Encouraged: Portfolio links, open-source contributions, or notable hackathon projects

Software Engineer - AI Agents

Key skills

About this role

Responsibilities:

Requirements: