Salient is building AI infrastructure for financial operations, automating complex workflows in loan servicing. The Staff Infrastructure Engineer will architect and manage the systems that support the company's growth, focusing on reliability, scalability, and developer velocity.
Responsibilities:
- Lead architectural decisions and technical reviews for infrastructure-critical initiatives
- Design, build, and own the cloud infrastructure (AWS/GCP) that runs Salient - from compute and networking to storage and observability
- Develop scalable harnesses that enable coding agents to operate reliably without compromising system stability or code quality
- Partner closely with the modeling team to optimize the serving and performance of GPU-intensive workloads
- Drive reliability and performance across the stack by defining SLOs, building robust monitoring and alerting, and leading incident response and postmortems
- Own developer platform investments that materially improve engineering velocity, including CI/CD, deployment tooling, environments, and internal infrastructure abstractions
- Establish infrastructure best practices, patterns, and standards as a technical authority across the engineering org
- Identify and reduce technical debt across infrastructure systems, with a focus on long-term scalability and operational health