AirflowHadoopPythonRaySparkSQLGoRMachine LearningJupyterMLflowKubeflowAnalyticsGitGitHubBitbucketVersion Control
About this role
Role Overview
Design predictive modeling projects to address specific business problems determined by consultation with business partners
Work with data-sets of varying degrees of size and complexity including both structured and unstructured data
Piping and processing massive data-streams in distributed computing environments such as Hadoop to facilitate analysis
Implements batch and real-time model scoring to drive actions
Develops proprietary algorithms to build customized solutions that go beyond standard industry tools and lead to innovative solutions
Develop sophisticated visualization of analysis output for business users
Provides high-level controllership/evaluation of all output produced to ensure established targets are met
Proactively collaborates with business partners to determine identified population segments and develop actionable plans to enable the identification of patterns related to quality, use, cost and other variables
Requirements
Requires a Bachelor’s degree in Statistics, Computer Science, Mathematics, Machine Learning, Econometrics, Physics, Biostatistics or related Quantitative disciplines
3 or more years experience in predictive analytics
Advanced expertise with software such as Python, R, SAS, SAS Enterprise Miner or equivalent
PhD and experience in the healthcare sector preferred
Experience in Jupyter notebook server and notebooks and MLFlow is highly preferred
General working knowledge of the following tech/platform stack is preferred (Spark , EzPresto , KubeFlow, Airflow , Feast , Ray and Superset)
Experience with version control systems (Git/Bitbucket/Github, etc.) is preferred