CVS Health is a company focused on building a connected and compassionate health experience. They are seeking a Sr. Data Engineer to develop, build, and manage large-scale data structures and pipelines while collaborating with the Data Science team to enhance data analytics capabilities.
Responsibilities:
- Develop large scale data structures and pipelines to organize, collect and standardize data to generate insights and addresses reporting needs
- Write ETL (Extract/Transform/Load) processes, design database systems, and develop tools for real-time and offline analytic processing that improve existing systems and expand capabilities
- Collaborate with Data Science team to transform data and integrate algorithms and models into automated processes
- Test and maintain systems and troubleshoot malfunctions
- Leverage knowledge of Hadoop architecture, HDFS commands, and designing and optimizing queries to build data pipelines
- Utilize programming skills in Python, Java, or similar languages to build robust data pipelines and dynamic systems
- Build data marts and data models to support Data Science and other internal customers
- Integrate data from a variety of sources and ensure adherence to data quality and accessibility standards
- Analyze current information technology environments to identify and assess critical capabilities and recommend solutions to complex business problems
- Experiment with available tools and advise on new tools to provide optimal solutions that meet the requirements dictated by the model/use case
Requirements:
- Master's degree (or foreign equivalent) in Computer Science, Data Science, Statistics, Mathematics, Analytics, or a related field
- one (1) year of experience in the job offered or related occupation
- one (1) year of experience in Software development best practices
- one (1) year of experience in Data analytics on large data sets in healthcare, business, or retail sector
- one (1) year of experience in Java, Python, or R
- one (1) year of experience in Agile methodologies or SAFe Software Development Principles
- one (1) year of experience in Azure, Amazon Web Services (AWS), or Google Cloud Platform (GCP)
- one (1) year of experience in SQL or SAS
- one (1) year of experience in Spark, PySpark, or Scala
- one (1) year of experience in Databricks
- one (1) year of experience in Extract/Transform/Load (ETL) processes
- one (1) year of experience in Cloud components including cluster management
- one (1) year of experience in Contributing to large-scale applications development, data science, or data analytics projects
- one (1) year of experience in Designing data architectures, including data pipelines, distributed computing engines, and machine learning infrastructure design
- one (1) year of experience in Healthcare data management processes and techniques, including data standards, interoperability, and proper data privacy