Mercury Insurance is a recognized leader in the insurance industry, recently awarded as one of America's Best Midsize Employers for 2025. They are seeking an AI Engineer to join their AI/ML Engineering team, where the role involves implementing advanced AI solutions, including Large Language Models and multimodal applications, while collaborating with the engineering team to enhance software systems.

Responsibilities:

Implement and fine-tune Large Language Models (LLMs) for various business applications
Develop Vision Language Models (VLMs) for image understanding and multimodal applications
Design and build document understanding systems leveraging OCR and advanced text processing
Create speech recognition and conversational AI solutions
Recommend and establish human-in-the-loop processes for AI model improvement across modalities
Build and deploy AI agents and assistants using foundation models
Integrate multimodal capabilities (text, vision, speech) into unified AI systems
Evaluate and benchmark foundation models across different domains and tasks
Implement effective prompt engineering strategies for LLMs and VLMs
Partner with our platform team for model serving infrastructure and monitoring solutions for production AI systems
Create and maintain evaluation frameworks and metrics for different AI modalities
Collaborate with team members to integrate AI capabilities into existing software systems
Stay current with emerging AI technologies and research advancements

Requirements:

Bachelor's degree in Computer Science, Software Engineering, or related technical field
2+ years of software engineering experience
2+ years of hands-on experience with AI/ML systems
Experience with Large Language Models and/or Vision Language Models
Familiarity with at least one of: document understanding, OCR, speech recognition, or conversational AI
Experience with foundation model evaluation, selection, or fine-tuning
Knowledge of distributed systems concepts
Strong programming skills in Python and at least one other modern language (e.g., Java, Go)
Solid understanding of software design patterns and principles
Experience with AI/ML frameworks (PyTorch, TensorFlow, or similar)
Proficiency with foundation model APIs (OpenAI, Anthropic, etc.)
Knowledge of LLM architectures and capabilities
Knowledge and experience with fine-tuning foundational models
Experience with at least one: computer vision frameworks, OCR technologies, speech recognition systems, or conversational AI platforms
Experience with prompt engineering and optimization for foundation models
Knowledge of AI evaluation methodologies and metrics
Ability to work with cloud-based AI infrastructure
Good verbal and written communication abilities
Master's degree or PhD in Computer Science, Software Engineering, or related technical field
3+ years of software engineering experience
3+ years of hands-on experience implementing AI/ML solutions
Demonstrated experience across multiple AI domains (language, vision, speech)
Experience developing document understanding or OCR systems
Track record of building conversational AI or speech recognition applications
Experience with model deployment and monitoring in production
Experience with multimodal AI systems integration
Familiarity with large language model evaluation benchmarks
Experience with multimodal AI systems (text, vision, speech)
Expertise in document understanding or information extraction
Skills in building conversational agents or voice assistants
Experience with computer vision models and image processing
Knowledge of RAG (Retrieval Augmented Generation) systems
Experience with human-in-the-loop AI systems
Familiarity with AI alignment and safety practices
Experience with model deployment and serving infrastructure
Skills in AI/ML performance optimization techniques
Experience with containerization and orchestration technologies

AI Engineer

Key skills

About this role

Responsibilities:

Requirements: