Nscale is a GPU cloud provider engineered for AI, offering high-performance infrastructure to AI-focused companies. They are seeking a Data Engineer to design and operate the data foundations that support their platform and internal operations, transforming raw operational signals into reliable data products.

Responsibilities:

Design and build scalable, reliable data pipelines that ingest data from infrastructure, platform services, and business systems
Define data models and schemas that support operational workflows and use cases across the business, monitoring, and analytics
Clean, transform and structure the data to create a digital twin of Nscale
Implement permissioning and manage access and security of the Foundry implementation
Create trusted datasets and metrics that power workflows and processes, internal tools, and customer-facing insights
Enable self-serve analytics by establishing clear data contracts, documentation, and semantic layers
Build use cases including but not limited to capacity planning, cost optimisation, reliability analysis, and customer reporting to drive our business forward
Collaborate with Product and Commercial teams to translate real-world questions into robust data solutions
Implement data quality checks, monitoring, and alerting to ensure data correctness and availability
Codify data lineage, freshness, and consistency across systems
Establish best practices around data versioning, access control, and governance appropriate for a fast-scaling company
Continuously improve system resilience and observability
Take end-to-end ownership of projects, from design through to production and iteration
Help define standards, tooling, and ways of working for data at Nscale
Contribute to technical decision-making as the company scales its platform and customer base
Act as a thought partner to engineers and operators, not just a service function

Requirements:

Deep, hands-on experience building in Palantir Foundry, including ontology modelling, pipeline development, API integration, and large-scale data platform design
Strong proficiency in Python, with experience applying data engineering libraries and frameworks (e.g. Spark, PySpark, Dask, pandas) to work with large, complex datasets
Familiarity with API-driven data integration, including REST, GraphQL, and Foundry Action APIs
Practical experience working in Git-based development workflows, including code reviews, version control, and CI/CD pipelines
Comfort working in ambiguous, early-stage environments where requirements evolve quickly
Strong communication skills — able to explain data concepts clearly to technical and non-technical stakeholders
A bias toward ownership, pragmatism, and shipping useful solutions
Experience with cloud platforms (AWS, GCP, Azure) and infrastructure telemetry
Familiarity with distributed systems, monitoring data, or usage-based billing data
Experience supporting customer-facing data products or platforms

Data Engineer

Key skills

About this role

Responsibilities:

Requirements: