Akkodis is a staffing company with a strong presence across North America, dedicated to connecting clients with top talent. They are seeking a Senior Machine Learning Engineer to design, build, and deploy Large Language Model (LLM) solutions in a production environment, focusing on high-throughput, low-latency AI systems.
Responsibilities:
- Design, build, and deploy LLM‑powered systems handling large‑scale, real‑world workloads
- Write production‑quality Python code from scratch for data pipelines, model training, inference, and evaluation
- Build and maintain end‑to‑end ML pipelines (ingestion → training → validation → deployment)
- Fine‑tune and adapt LLMs using: Full fine‑tuning, LoRA, Prompt engineering, DPO / preference‑based optimization
- Optimize models for latency, throughput, memory, and cost Quantization, compression, distillation
- Design and implement Retrieval‑Augmented Generation (RAG) systems: Vector databases, Embeddings, Semantic search and document chunking
- Build evaluation frameworks, feedback loops, and continuous improvement pipelines
- Implement MLOps best practices: CI/CD for ML, Monitoring, logging, alerting
- Highly available production systems
Requirements:
- 5+ years of hands-on ML engineering experience in production environments
- Expert-level Python — ability to design and implement systems without AI‑generated code
- Proven experience deploying LLMs and NLP systems at scale
- Strong background in: LLM fine‑tuning and adaptation techniques
- Model optimization (quantization, compression, distillation)
- RAG architectures and vector search
- Experience with modern ML/LLM frameworks (e.g., LangChain, LlamaIndex)
- Solid MLOps experience (CI/CD, automated evaluation, monitoring)
- Cloud ML experience (e.g., GCP, Databricks, or similar)