Neovance is transforming the patient experience through innovation and operational excellence in the biopharmaceutical industry. As a Senior Data Engineer, you will design and manage data pipelines for real-time analytics and enterprise reporting, collaborating with both US and India-based teams to enhance patient access solutions.
Responsibilities:
- Design, build, and optimize scalable near real-time data pipelines using Databricks as the core compute and processing engine
- Architect and maintain foundational data models and semantic layers that power enterprise Power BI reporting and Master Data Management (MDM) initiatives
- Lead the extraction and migration of legacy data from on-premise and enterprise systems (Oracle, SQL Server, Siebel) into the Databricks Bronze layer
- Design and implement a comprehensive data quality framework to ensure high-fidelity, reliable data delivery across the platform
- Establish and enforce data governance, access controls, and security best practices using Unity Catalog
- Champion DataOps methodologies and version control standards across the engineering team
- Serve as a technical subject matter expert, consulting directly with internal stakeholders and external clients to gather requirements and deliver customer-facing data solutions
- Translate complex technical architecture into clear, actionable insights for non-technical audiences — including biopharma clients
- Partner daily with our India-based engineering team to coordinate sprint execution, manage handoffs, and maintain alignment across time zones
- Collaborate with US-based Product/Project Managers, Data Analytics, and Data Governance teams to prioritize and deliver against roadmap goals
- Actively participate in sprint planning, ticket management, and documentation using Jira, Confluence, and SharePoint
Requirements:
- 5+ years of dedicated experience in Data Engineering or a closely related field
- Expert-level, hands-on proficiency with Databricks — including pipeline architecture, performance tuning, and near real-time streaming — and Unity Catalog
- Deep expertise in data modeling and architecture designed to power enterprise Power BI deployments
- Advanced SQL, deep experience with SQL Server, and strong programming skills in Python and/or Scala
- Working knowledge of AWS for cloud infrastructure and storage integration
- Proven track record implementing DataOps, version control (Git), and robust data quality frameworks
- Demonstrated ability to work independently, navigate ambiguity, and present technical concepts to client stakeholders across global time zones
- Hands-on experience with Microsoft Azure and/or Google Cloud Platform (GCP)
- Pharmaceutical domain expertise, including familiarity with industry reference data sets such as NPPES, NCPDP, Evaluate Pharma, Medi-Span, and Symphony
- Experience integrating with ERPs (NetSuite, SAP), HR Information Systems, and Salesforce
- Experience with enterprise integration platforms such as Boomi, Pega, or Informatica
- Strong background in Master Data Management (MDM) delivery