This role is responsible for data collection procedures, including accurate and relevant data for machine learning models, extracting and analyzing data from the primary and secondary database.
Ensures data integrity and compliance by performing data cleansing and data validations.
Performs root-cause analysis and recommends or executes corrective actions by implementing auto detect, auto dragonize and auto correct when data from related system problems occur.
Applies subject matter knowledge to solve complex business issues within established guidelines and recommends appropriate alternatives.
Designs and establishes secure and performant data architectures, enhancements, updates, and programming changes for portions and subsystems of data pipelines, repositories or models for structured/unstructured data.
Analyzes design and determines coding, programming, and integration activities required based on general objectives and knowledge of overall architecture of product or solution.
Writes and executes complete testing plans, protocols, and documentation for assigned portion of data system or component; identifies and debugs, and creates solutions for issues with code and integration into data system architecture.
Leads a project team of other data engineers to develop reliable, cost effective and high-quality solutions for assigned data system, model, or component.
Collaborates and communicates with project team regarding project progress and issue resolution.
Analyzes data inaccuracies, identifies opportunities and supports the development of automated solutions to enhance overall quality of the enterprise data.
Identifies problematic areas and conducts research to determine the best course of action to correct the data; identifies, analyzes and interprets trends and patterns in complex datasets.
Works cross-functionally with different departments to assess, define, and develop report deliverables.
Represents the software data engineering team for all phases of larger and more-complex development projects.
Provides guidance and mentoring to less experienced staff members.
Requirements
Four-year or Graduate Degree in Computer Science, Information Technology, Software Engineering, Statistics/ Mathematics, or any other related discipline or commensurate work experience or demonstrated competence.
Typically has 4-7 years of work experience, preferably in data analytics, data engineering, data modeling, or a related field or an advanced degree with 3-5 years of work experience.
Programming Language/s Certification (SQL, Python, or similar) and experience in implementing AI solutions to solve complex data quality problems