PandasPythonScikit-LearnSparkSQLTableauMachine LearningMLLarge Language Modelsscikit-learnXGBoostJupyterDatabricks
About this role
Role Overview
Work remotely between the hours of 9 – 5 PST
Collaborate with internal stakeholders to understand their goals and objectives, enabling data-driven decision-making across the organization.
Utilize programming skills to explore and analyze data.
Identify patterns, trends, nuances, and potential data quality issues within complex datasets.
Apply supervised and unsupervised machine learning techniques to extract insights from data.
Communicate analysis findings and conclusions effectively to both technical and non-technical stakeholders.
Integrate datasets from diverse business sources into a unified format for analysis.
Develop custom queries to address ad-hoc and ongoing customer analytical requests.
Requirements
Pursuing a bachelor’s or graduate degree in a quantitative field (e.g., Data Science, Computer Science, Mathematics) and must not graduate prior to September 2026
Proficient in SQL, Python, and Pandas, with experience using Jupyter Notebooks.
Skilled in creating visualizations with Python libraries (e.g., matplotlib, seaborn, plotly).
Familiar with statistical principles (e.g., hypothesis testing), the machine learning model lifecycle, and ML libraries (e.g., XGBoost, Sklearn).
Experience programmatically interfacing with large language models (LLMs).