CVS Health is a company focused on creating a more connected and compassionate health experience. They are seeking a Principal Data Engineer to build, develop, and maintain cloud-native software applications and systems to support business needs, including data structures, ETL processes, and collaboration with the Data Science team.
Responsibilities:
- Assist with cloud-native application development, web development, API integration and scaling them in real time to meet internal users and end-customer customer needs
- Participate in entire software lifecycle development, testing, CI/CD and production operations
- Develop large scale data structures and pipelines to organize, collect and standardize data to generate insights and addresses reporting needs
- Write ETL (Extract/Transform/Load) processes, design database systems, and develop tools for real-time and offline analytic processing
- Collaborate with Data Science team to transform data and integrate algorithms and models into automated processes
- Leverage knowledge of Hadoop architecture, HDFS commands, and designing and optimizing queries to build data pipelines
- Utilize programming skills in Python, Java, or similar languages to build robust data pipelines and dynamic systems
- Build data marts and data models to support Data Science and other internal customers
- Integrate data from a variety of sources and ensure adherence to data quality and accessibility standards
- Analyze current information technology environments to identify and assess critical capabilities and recommend solutions
- Experiment with available tools and advise on new tools to provide optimal solutions that meet the requirements dictated by the model/use case
- Mentor junior Cloud Engineers
Requirements:
- Bachelor's degree (or foreign equivalent) in Computer Science, Computer Engineering, Information Technology, Engineering, or a related field
- five (5) years of progressive, postbaccalaureate experience in the job offered or related occupation
- five (5) years of experience in Software development lifecycle (SDLC)
- five (5) years of experience in CI/CD, Jenkins, GIT, or DevOps
- five (5) years of experience in Java, Python, or Node.js
- five (5) years of experience in XML, JSON, HTML, CSS, or JavaScript
- five (5) years of experience in Agile methodologies or SAFe Software Development Principles
- five (5) years of experience in Angular, JavaScript, React, jQuery, Ajax, Bootstrap, or Backbone
- five (5) years of experience in REST, SOAP, or Web Service APIs
- five (5) years of experience in Docker or Kubernetes
- five (5) years of experience in Hadoop and Hive
- five (5) years of experience in Spark, PySpark, or Scala
- five (5) years of experience in Airflow, Kafka, Hbase, Pig, MySQL, or NoSQL
- five (5) years of experience in Unix, Linux, or Shell scripting
- five (5) years of experience in developing backend services, performing code reviews, and collaborating with peers on software development solutions
- five (5) years of experience in contributing to large-scale applications development, data science, or data analytics projects
- five (5) years of experience in designing data models and solutions for analytical and reporting use cases
- five (5) years of experience in Big Data implementation
- five (5) years of experience in developing data science products
- five (5) years of experience in designing data architectures, including data pipelines, distributed computing engines, and machine learning infrastructure design
- five (5) years of experience in end-to-end data science lifecycle/workflow
- five (5) years of experience in product management in a large, matrixed organization with many internal partners
- five (5) years of experience in working with UI/UX teams to develop wireframes and prototypes based on end user feedback and human centered design
- five (5) years of experience in healthcare data management processes and techniques, including data standards, interoperability, and proper data privacy
- five (5) years of experience in supporting large data, analytics, and technology modernization initiatives