Design, build and test end to end data pipeline including data ingestion (streaming, events, and batch) on cloud-based infrastructure using Azure, Cosmos DB
Develop and implement ETL processes for data ingestion, transformation and loading data into data lakes
Work extensively on Databricks and data warehousing concepts
Design & develop custom high throughput and configurable frameworks/libraries
Ability to drive change through collaboration, influence and demonstration of POCs
Responsible for all aspects of the software development lifecycle, including design, coding, integration testing, deployment, and documentation
Work collaboratively within an agile project team
Follow best practices and coding standards
Grow your personal skillset
Requirements
Bachelor’s Degree level qualification in a computer or IT related subject
10+ years of overall Bigdata data pipeline experience
5+ years of Databricks hands on experience
5+ years of experience on cloud-based development including Azure Services, Azure Devops, Kubernetes, Docker
Experience on Snowflake is plus
Experience in micro-services architecture and understanding of Cloud Computing is highly desirable (Azure preferred)
Proficient working knowledge on Data Warehousing platforms as Databricks, Delta lake and Apache spark
Hands on development experience in Databricks SQL and Scala
Strong understanding of Databricks platform including clusters, jobs and other resources
Monitor and troubleshoot data pipelines to identify and resolve issue
Implement Data quality checks and validations to ensure data accuracy
Experience performing data analysis and data exploration
Strong hands-on experience in troubleshooting Devops pipelines (ADO and Harness)
Experience working in a multi-developer environment, using version control (i.e. Git)
Strong critical thinking, communication, and problem-solving skills
Experience in handling semi-structured data (Avro, JSON and XML)
Basic knowledge on shell scripting, UNIX, TOAD, SQL developer.
Tech Stack
Apache
Azure
Cloud
Docker
ETL
Kubernetes
Scala
Shell Scripting
Spark
SQL
Unix
Benefits
generous medical care
insurance coverage including basic life, medical, dental, vision, long-term disability, and other optional additional coverages
retirement savings plan (401K) with company match
paid-time off including vacation, sick leave, short term disability, and family care responsibilities
access to our Employee Assistance Program
incentive compensation including eligibility for annual performance-based awards (excluding certain sales roles subject to sales incentive plans)
eligibility for certain tax advantaged savings plans