Clinician Nexus enables health care organizations to build thriving clinician teams with industry-leading technology products and analytics. They are seeking a highly skilled Machine Learning Engineer to develop and deploy machine learning models and advanced data analytics solutions, collaborating with cross-functional teams to drive data-informed decision-making.

Responsibilities:

Design, develop, and deploy ML solutions ranging from traditional ML applications (classification, clustering, recommendations) to LLM-based systems, including document parsing, data extraction, RAG pipelines, and LLM agents
Write clean, maintainable, production-quality Python code that integrates smoothly with existing engineering and deployment infrastructure
Work with large datasets to clean, preprocess, and analyze data, ensuring data quality and integrity
Implement and optimize algorithms using best practices in machine learning, deep learning, and statistical analysis
Collaborate with business stakeholders to understand requirements and deliver data-driven solutions that provide actionable insights
Develop and maintain scalable pipelines and infrastructure for data processing and model training, versioning, deployment, and monitoring
Evaluate the performance of machine learning models, including LLM-specific evaluation approaches, and tune models for optimal performance
Communicate findings, insights, and model performance to both technical and non-technical audiences
Continuously stay updated on the latest trends, technologies, and best practices

Requirements:

Bachelor's or Master's degree in Computer Science, Data Science, Statistics, Mathematics, or a related field. or related experience
Bachelor with 5+ years of relevant experience
Master or higher with 3+ years of relevant experience
Fluent in Python (3+ years of coding experience)
Strong software development practices in Python, including writing maintainable, testable, production-ready code
Solid understanding of LLM architectures and Generative AI
Hands-on experience building and evaluating RAG pipelines
Experience with LLM orchestration frameworks (LangChain, LlamaIndex, or similar)
Proficiency in machine learning libraries such as Scikit-learn and PyTorch; and fundamental libraries such as NumPy and Pandas
Familiarity with cloud platforms (e.g., AWS, GCP, Azure) and containerization tools (e.g., Docker)
Strong understanding of model evaluation metrics across traditional ML (e.g., accuracy, precision, recall, F1) and LLM-based systems (e.g., faithfulness, answer relevancy, hallucination detection), including approaches for evaluating non-deterministic outputs
Experience with model management tools such as MLFlow and the model development life cycle
Experience with version control tools such as Git
Proficiency in adapting SDLC best practices for code development and testing
Excellent problem-solving skills, analytical thinking, and the ability to work in a fast-paced environment
Strong communication skills and the ability to explain complex technical concepts to non-technical stakeholders
Familiarity with optimizing, deploying and scaling automated training pipelines of transformer-based models
Familiarity with distributed training techniques and GPU-accelerated computing
Familiarity with classical NLP approaches
Experience implementing CI/CD pipelines for ML models for automating training, validation, monitoring, and scalable deployment
Experience with integrating and deploying AWS AI/ML services
Experience with Databricks
Experience in Health Care data

Machine Learning Engineer / Data Scientist

Key skills

About this role

Responsibilities:

Requirements: