Cyber SecurityNumpyPandasPythonScikit-LearnSQLAIMachine LearningMLGenerative AILLMLarge Language ModelsLangChainLlamaIndexAgenticscikit-learnNumPyCollaboration
About this role
Role Overview
Work across Expel using both traditional machine learning and generative AI to solve meaningful problems
Help teams develop the right metrics to measure what matters
Build models that surface threats in our customers’ environments
Design AI-powered tools that make our analysts faster and smarter
Collaborate across operations, engineering, and service delivery to ensure AI and ML capabilities are easy to deploy, monitor, and maintain
Partner with subject matter experts to explore where prompt engineering, knowledge retrieval, and agentic AI can deliver real value alongside classical approaches
Mentor and coach other data scientists and curious non-DS colleagues
Requirements
4+ years of professional data science experience building and shipping models in production
Solid grounding in traditional ML: classification, regression, anomaly detection, clustering, time series — using frameworks like scikit-learn or statsmodels
Hands-on experience with large language models, prompt engineering, and LLM application architectures including knowledge retrieval and grounding (e.g., embeddings-based search, vector stores)
Experience building or contributing to LLM-powered agentic systems, including tool use and orchestration with frameworks like LangChain, LlamaIndex, or similar
Strong Python and SQL skills, utilizing core data libraries (pandas/numpy) and visual toolkits (plotly/seaborn/matplotlib) to transform complex analysis into clear, actionable insights for stakeholders and cross-functional peers
Solid statistical foundations: distributions, hypothesis testing, probability, and the wisdom to know when a simpler model is the right one
Exemplary collaboration skills that result in higher levels of team confidence and morale, streamlining requirements gathering to drive superior output
Ability to communicate technical concepts clearly to non-technical audiences
Passion for finding order and meaning in large, chaotic data sets
Domain experience in cybersecurity (anomaly detection, threat hunting, risk scoring) is a strong plus.