Context66 is a Data, AI, and Enterprise Architecture services company focused on engineering the context that powers AI. They are seeking an exceptional Senior Data Architect / Lead Data Engineer who will design and implement modern enterprise data platforms and work closely with various teams to deliver production-grade solutions.

Responsibilities:

Design end-to-end enterprise data architectures across structured, unstructured, streaming, batch, and event-driven data
Define current-state, transition-state, and target-state architectures aligned with business outcomes
Design and implement modern data platforms, including data warehouses, data lakes, lakehouses, data mesh, data fabric, and federated data architectures
Build scalable data ingestion, integration, transformation, orchestration, and activation pipelines
Define integration patterns across ETL, ELT, APIs, CDC, messaging, streaming, micro-batch, virtualization, and zero-copy architectures
Design reusable, governed data products with clear ownership, contracts, SLAs, quality expectations, and lifecycle controls
Develop canonical, conceptual, logical, physical, and semantic data models
Design enterprise semantic layers, metric definitions, business entities, ontologies, and reusable context models
Build and integrate knowledge graphs, graph databases, GraphRAG solutions, vector stores, and retrieval architectures
Establish patterns for metadata management, lineage, data quality, observability, governance, security, privacy, and compliance
Design data and feature pipelines that support analytics, machine learning, Generative AI, and intelligent agent use cases
Create reference architectures, technical standards, reusable frameworks, accelerators, and engineering playbooks
Perform architecture reviews, design validation, code reviews, performance optimization, and technical risk assessments
Partner with client stakeholders to lead discovery, translate business requirements, and communicate architecture decisions clearly
Mentor engineers and architects while raising the technical quality and delivery maturity of the team
Contribute directly to prototypes, pilots, production implementations, and complex troubleshooting when needed

Requirements:

10 to 15+ years of experience across data architecture, data engineering, integration, analytics, and enterprise platforms
Strong combination of strategic data architecture and hands-on engineering expertise
Demonstrated experience designing and delivering large-scale, production-grade enterprise data solutions
Deep understanding of the complete data lifecycle, from creation and ingestion through processing, consumption, retention, archival, and deletion
Strong expertise in data modeling, including conceptual, logical, physical, canonical, and semantic models
Extensive experience with ETL, ELT, CDC, APIs, event-driven integration, streaming, orchestration, and data activation
Hands-on experience with modern cloud data platforms such as Snowflake, Databricks, AWS, Azure, or Google Cloud
Strong knowledge of data warehousing, lakehouse architectures, data lakes, data mesh, data fabric, virtualization, and federated query patterns
Practical experience with data governance, metadata, lineage, quality, observability, master data, access controls, and regulatory requirements
Deep understanding of reusable data products, data contracts, domain ownership, SLAs, and federated governance
Strong knowledge of analytics, BI, self-service data consumption, feature stores, and AI-ready data foundations
Excellent problem-solving, communication, documentation, and stakeholder-management skills
Ability to explain complex technical concepts clearly to both executives and engineering teams
Large language models, Generative AI, intelligent agents, and enterprise AI architectures
RAG, GraphRAG, knowledge graphs, graph databases, and vector databases
Ontologies, semantic modeling, enterprise semantic layers, and metric frameworks
Metadata-driven automation, active metadata, policy enforcement, and context engineering
AI evaluation, observability, grounding, hallucination reduction, security, and responsible AI controls
Integration of enterprise data platforms with intelligent applications and agent workflows
AI-enabled data quality, metadata enrichment, lineage discovery, and engineering automation
Claude Code, Cursor, GitHub Copilot, Windsurf, OpenAI, Gemini, or equivalent AI-assisted development tools
Agentic development frameworks and AI orchestration platforms
Model Context Protocol architectures and tool-enabled agent patterns
AI-assisted software engineering, testing, documentation, data modeling, and code generation
Autonomous agents, workflow automation, prompt engineering, and structured output patterns
Using AI to accelerate delivery while maintaining human validation, security, governance, and engineering rigor
Comfortable operating in an early-stage, high-growth environment
Able to work independently, move quickly, and deliver with limited supervision
Willing to move between architecture, engineering, client discussions, and delivery execution
Strong sense of ownership, accountability, urgency, and attention to quality
Customer-first mindset with a focus on measurable outcomes
Comfortable challenging assumptions and proposing better technical approaches
Passionate about learning, experimentation, innovation, and emerging technologies
Able to mentor others while remaining hands-on when the situation requires it
Interested in helping build a company, not only completing assigned project tasks
Bachelor's or Master's degree in Computer Science, Information Technology, Engineering, Data Science, or a related discipline
Relevant cloud, data platform, architecture, graph technology, or AI certifications are beneficial

Senior Data Architect / Data Engineer AI/ML

Key skills

About this role

Responsibilities:

Requirements: