Karumi is a fast-growing company focused on delivering personalized product demos using AI technology. They are seeking an AI Engineer to join their team, where the main responsibilities include building and optimizing voice AI systems, designing browser agents, and integrating multimodal AI for real-time user interactions.
Responsibilities:
- Build and optimize voice AI systems using speech-to-text and text-to-speech models
- Design browser agents that navigate, understand, and interact with web applications
- Implement browser automation with computer vision and DOM understanding
- Engineer prompt systems and LLM workflows for consistent, intelligent behavior
- Create evaluation frameworks to measure voice quality, agent accuracy, and user experience
- Integrate multimodal AI - combining voice, vision, and language understanding
- Build real-time AI pipelines where latency and reliability are critical
- Manage the AI Infrastructure and take care of it
- Monitor and improve AI system performance in production environments