Mercury Insurance is a recognized leader in the insurance industry, recently awarded as one of America's Best Midsize Employers for 2025. They are seeking an AI Engineer to join their AI/ML Engineering team, where the role involves implementing advanced AI solutions, including Large Language Models and multimodal applications, while collaborating with the engineering team to enhance software systems.
Responsibilities:
- Implement and fine-tune Large Language Models (LLMs) for various business applications
- Develop Vision Language Models (VLMs) for image understanding and multimodal applications
- Design and build document understanding systems leveraging OCR and advanced text processing
- Create speech recognition and conversational AI solutions
- Recommend and establish human-in-the-loop processes for AI model improvement across modalities
- Build and deploy AI agents and assistants using foundation models
- Integrate multimodal capabilities (text, vision, speech) into unified AI systems
- Evaluate and benchmark foundation models across different domains and tasks
- Implement effective prompt engineering strategies for LLMs and VLMs
- Partner with our platform team for model serving infrastructure and monitoring solutions for production AI systems
- Create and maintain evaluation frameworks and metrics for different AI modalities
- Collaborate with team members to integrate AI capabilities into existing software systems
- Stay current with emerging AI technologies and research advancements
Requirements:
- Bachelor's degree in Computer Science, Software Engineering, or related technical field
- 2+ years of software engineering experience
- 2+ years of hands-on experience with AI/ML systems
- Experience with Large Language Models and/or Vision Language Models
- Familiarity with at least one of: document understanding, OCR, speech recognition, or conversational AI
- Experience with foundation model evaluation, selection, or fine-tuning
- Knowledge of distributed systems concepts
- Strong programming skills in Python and at least one other modern language (e.g., Java, Go)
- Solid understanding of software design patterns and principles
- Experience with AI/ML frameworks (PyTorch, TensorFlow, or similar)
- Proficiency with foundation model APIs (OpenAI, Anthropic, etc.)
- Knowledge of LLM architectures and capabilities
- Knowledge and experience with fine-tuning foundational models
- Experience with at least one: computer vision frameworks, OCR technologies, speech recognition systems, or conversational AI platforms
- Experience with prompt engineering and optimization for foundation models
- Knowledge of AI evaluation methodologies and metrics
- Ability to work with cloud-based AI infrastructure
- Good verbal and written communication abilities
- Master's degree or PhD in Computer Science, Software Engineering, or related technical field
- 3+ years of software engineering experience
- 3+ years of hands-on experience implementing AI/ML solutions
- Demonstrated experience across multiple AI domains (language, vision, speech)
- Experience developing document understanding or OCR systems
- Track record of building conversational AI or speech recognition applications
- Experience with model deployment and monitoring in production
- Experience with multimodal AI systems integration
- Familiarity with large language model evaluation benchmarks
- Experience with multimodal AI systems (text, vision, speech)
- Expertise in document understanding or information extraction
- Skills in building conversational agents or voice assistants
- Experience with computer vision models and image processing
- Knowledge of RAG (Retrieval Augmented Generation) systems
- Experience with human-in-the-loop AI systems
- Familiarity with AI alignment and safety practices
- Experience with model deployment and serving infrastructure
- Skills in AI/ML performance optimization techniques
- Experience with containerization and orchestration technologies