Chainguard is the trusted source for open source, delivering hardened, secure builds of open source software. The Staff Software Engineer on the Libraries Platform team will lead the architecture and implementation of a platform that builds, verifies, and serves open-source libraries, ensuring they are secure and up to date.
Responsibilities:
- Own the architecture and technical direction for the Libraries Platform: the services, pipelines, and package index that power secure, reproducible build, test, and distribution workflows for libraries across multiple ecosystems (Java, JavaScript, Python/AI/ML)
- Design and maintain automation for artifact creation, updates, and verification, including vulnerability scanning, remediation workflows, SBOM and provenance generation, and policy enforcement across our library catalog
- Build and operate shared platform services such as package indexes, registry mirrors, metadata services, and orchestration tooling that serve both external customers and internal ecosystem teams
- Develop internal developer tools and CLIs (often in Go) that improve how we build, test, and ship libraries at scale, including integration with build systems and CI/CD for multiple ecosystems
- Drive reliability, scalability, and observability for the Libraries platform: define SLOs, build monitoring and alerting, and lead incident response and post‑incident improvements
- Solve complex dependency and build issues in production environments, from toolchain and compiler problems to CI/CD flakiness and registry/package index edge cases
- Partner closely with ecosystem teams (Java, JavaScript, Python/AI/ML), Platform, Delivery, Sustaining, and Security to ensure the platform meets reliability, security, and product requirements
- Mentor and unblock other engineers through design reviews, documentation, and hands‑on debugging, helping to “code culture” into how we build and run our libraries platform
Requirements:
- 8+ years designing, building, and operating infrastructure for language ecosystems or developer platforms, such as build systems, package registries, or CI/CD for widely used libraries or services
- Strong proficiency in Go (Golang) or strong readiness to ramp quickly
- Proven track record building and owning developer tooling and automation (plugins, CLIs, code generators, or custom pipelines) that improve how engineers build and ship software at scale
- Strong background in CI/CD, cloud‑native infrastructure, and IaC: containers (Docker/OCI, Kubernetes), public cloud (GCP, AWS, Azure), and tools like Terraform and GitHub Actions/Argo/Tekton (or equivalents)
- Demonstrated ability to debug and resolve complex toolchain, compiler, packaging, and infrastructure failures in production, and to drive those issues to root cause and lasting fixes
- Comfortable working across SRE / platform / DevOps style responsibilities, including reliability, observability, and performance tuning for critical services and pipelines
- Excellent communication in a remote, distributed environment, with a bias toward documentation, clarity, and collaboration across product, infra, and security teams
- A staff‑level ownership mindset: you set technical direction, own critical outcomes, and are comfortable in an early, high‑impact area where engineers help shape both the roadmap and the culture
- Open source contributions in ecosystem tooling, libraries, or packaging (Java, JavaScript, Python/ML, or related infra)
- Experience with software supply chain security: SLSA, SBOMs, sigstore, provenance, attestations, or secure‑by‑default packaging practices
- Background with Linux distributions, packaging, and reproducible build systems (e.g., Alpine, Wolfi, Debian Bazel, CMake, Ninja)
- Familiarity with AI/ML packaging and infrastructure building native Python libraries and ML frameworks (e.g., PyTorch, TensorFlow) and deploying them in cloud/Kubernetes environments
- Prior experience in SRE, platform engineering, or DevOps roles where you owned infrastructure for developer productivity, CI/CD, or large language‑ecosystem codebases