Lead the design and evolution of AI-powered backend systems that support conversation guidance, matchmaking, and personalised user experiences within Bee AI
Architect and scale agent-based systems, including prompt management, context orchestration, and evaluation frameworks, to ensure high-quality, reliable AI interactions
Establish best practices for building, testing, monitoring, and iterating on AI agents, creating a repeatable and scalable development lifecycle across the team
Drive experimentation and prototyping of new AI capabilities, translating early concepts into production-ready systems that deliver measurable user value
Partner cross-functionally with Product, Data, and Trust teams to shape AI-driven features that are both innovative and responsible, demonstrating Respect in how technology impacts people
Contribute deep expertise in Python (including PydanticAI) and Google Cloud Platform (GCP) to build robust, scalable, and observable services
Define and implement evaluation strategies for LLM-powered systems, including offline and online metrics, human-in-the-loop validation, and continuous improvement loops
Act as a technical advisor and thought leader, influencing architecture and long-term strategy across teams while embodying an agile mindset and taking ownership of outcomes
Requirements
Deep experience building and scaling backend systems, with strong expertise in Python and modern service-oriented or distributed architectures
Proven track record of designing and deploying AI/ML-powered products, particularly involving LLMs, agents, or recommendation/personalisation systems
Hands-on experience with prompt engineering, context management, and evaluation of AI systems, with a strong understanding of their limitations and trade-offs
Experience working with cloud platforms such as GCP, including deploying, monitoring, and scaling production systems
Ability to operate at a system level, setting technical direction and influencing across teams while remaining hands-on with implementation
Demonstrated ability to collaborate with purpose, take ownership of complex problems, and see solutions through from insight to real-world impact
Strong commitment to responsible and inclusive AI practices, applying human judgment to ensure fairness, safety, and quality in outputs
Typically requires 10+ years of experience, though we welcome candidates with alternative backgrounds that demonstrate equivalent skills.
High level of AI fluency
you actively leverage AI as a development partner, contribute to evolving best practices, and help others adopt AI effectively and responsibly
Tech Stack
Cloud
Google Cloud Platform
Python
Benefits
Medical/dental/vision, 30-day eligibility
Unlimited PTO + 1 company-wide week off + Focus Fridays every week
Fully paid life and long-term disability insurance
401k with 4% company match if you contribute 6%, 90-day eligibility
Monthly wellness benefit and access to Noom, Unmind, and Your Money Line
Maternity and Fertility benefit + 26 week paid parental leave