Epiq AI Labs is focused on building AIDA, an AI-driven platform for legal discovery and knowledge management. The Platform Engineer role involves designing and implementing core systems that enhance performance, scalability, and reliability across the platform, while collaborating with various engineering teams.
Responsibilities:
- Design and implement identity and access management solutions using modern authentication standards (OAuth2, OIDC, SAML)
- Design and build secure, scalable Customer Identity and Access Management (CIAM) systems for multi-tenant environments
- Establish authorization patterns and policies that support complex organizational hierarchies and role-based access control
- Design and implement multi-tenant database architectures that support AI workloads at scale
- Optimize relational, vector, and search database deployments for performance and cost efficiency
- Produce technical design documents and lead execution of database infrastructure improvements
- Establish data modeling patterns and best practices across the platform
- Design and implement message-queue-based asynchronous processing frameworks
- Lead migrations from legacy systems to modern, scalable architectures
- Improve system reliability and fault tolerance
- Design and implement scalable systems to handle multi-million small file workloads, including parsing, ingestion, storage, partitioning, and indexing strategies
- Build data pipeline architectures with comprehensive observability to monitor pipeline health, throughput, and data quality
- Optimize storage layouts and access patterns for legal document processing at scale
- Work closely with product, AI, backend, and security teams to align infrastructure capabilities with platform needs
- Establish best practices that improve system maintainability and developer velocity
Requirements:
- 5+ years of experience in platform engineering, infrastructure engineering, or related roles
- Proven experience designing and implementing Customer Identity and Access Management (CIAM) systems
- Familiarity with PostgreSQL, Qdrant and Solr
- Demonstrated experience building and operating production systems at scale
- Strong track record of system design and architecture, including producing technical design documents
- Proficiency in at least one programming language commonly used for infrastructure (Python, Go, or similar)
- Hands-on experience with infrastructure-as-code tools (Terraform or similar)
- Experience with Azure cloud platforms
- Experience with observability tooling (Prometheus, Grafana, OpenTelemetry, or similar)
- Familiarity with containerized environments (Kubernetes, Docker)
- Experience with AI/ML infrastructure and data-intensive applications