Chainguard is the secure foundation for software development and deployment. They are seeking a Staff Software Engineer to lead the architecture and implementation of their Libraries Platform, focusing on building secure and reliable open-source libraries while automating workflows and enhancing developer productivity.
Responsibilities:
- Own the architecture and technical direction for the Libraries Platform: the services, pipelines, and package index that power secure, reproducible build, test, and distribution workflows for libraries across multiple ecosystems (Java, JavaScript, Python/AI/ML)
- Design and maintain automation for artifact creation, updates, and verification, including vulnerability scanning, remediation workflows, SBOM and provenance generation, and policy enforcement across our library catalog
- Build and operate shared platform services such as package indexes, registry mirrors, metadata services, and orchestration tooling that serve both external customers and internal ecosystem teams
- Develop internal developer tools and CLIs (often in Go) that improve how we build, test, and ship libraries at scale, including integration with build systems and CI/CD for multiple ecosystems
- Drive reliability, scalability, and observability for the Libraries platform: define SLOs, build monitoring and alerting, and lead incident response and post‑incident improvements
- Solve complex dependency and build issues in production environments, from toolchain and compiler problems to CI/CD flakiness and registry/package index edge cases
- Partner closely with ecosystem teams (Java, JavaScript, Python/AI/ML), Platform, Delivery, Sustaining, and Security to ensure the platform meets reliability, security, and product requirements
- Mentor and unblock other engineers through design reviews, documentation, and hands‑on debugging, helping to “code culture” into how we build and run our libraries platform
Requirements:
- 8+ years designing, building, and operating infrastructure for language ecosystems or developer platforms, such as build systems, package registries, or CI/CD for widely used libraries or services
- Strong proficiency in Go (Golang) or strong readiness to ramp quickly
- Proven track record building and owning developer tooling and automation (plugins, CLIs, code generators, or custom pipelines) that improve how engineers build and ship software at scale
- Strong background in CI/CD, cloud‑native infrastructure, and IaC: containers (Docker/OCI, Kubernetes), public cloud (GCP, AWS, Azure), and tools like Terraform and GitHub Actions/Argo/Tekton (or equivalents)
- Demonstrated ability to debug and resolve complex toolchain, compiler, packaging, and infrastructure failures in production, and to drive those issues to root cause and lasting fixes
- Comfortable working across SRE / platform / DevOps style responsibilities, including reliability, observability, and performance tuning for critical services and pipelines
- Excellent communication in a remote, distributed environment, with a bias toward documentation, clarity, and collaboration across product, infra, and security teams
- A staff‑level ownership mindset: you set technical direction, own critical outcomes, and are comfortable in an early, high‑impact area where engineers help shape both the roadmap and the culture
- Open source contributions in ecosystem tooling, libraries, or packaging (Java, JavaScript, Python/ML, or related infra)
- Experience with software supply chain security: SLSA, SBOMs, sigstore, provenance, attestations, or secure‑by‑default packaging practices
- Background with Linux distributions, packaging, and reproducible build systems (e.g., Alpine, Wolfi, Debian Bazel, CMake, Ninja)
- Familiarity with AI/ML packaging and infrastructure building native Python libraries and ML frameworks (e.g., PyTorch, TensorFlow) and deploying them in cloud/Kubernetes environments
- Prior experience in SRE, platform engineering, or DevOps roles where you owned infrastructure for developer productivity, CI/CD, or large language‑ecosystem codebases