ZenBusiness is a remote company dedicated to helping entrepreneurs launch and run their businesses. The Staff Software Engineer will contribute to platform-wide initiatives focusing on CI/CD, Kubernetes, and integrating AI tooling into engineering workflows.
Responsibilities:
- Design, build, and scale AI-powered developer tooling:
- Agentic coding and review workflows that use LLMs to analyze code, generate changes, and collaborate via chat
- Policy- and guardrail-aware assistants that can comment, propose code changes, and safely automate repetitive tasks
- Prompt libraries, instructions, and service manuals that make AI adoption effective and safe across teams
- Contribute to the evolution of our CI/CD platform to be faster, more observable, and easier to adopt:
- Build analytics and data pipelines to compare platform performance, baselines, and regressions
- Contribute insights that inform team-level improvements and broader platform decisions
- Strengthen Platform Reliability And Security In Kubernetes
- Improve cluster/app resiliency, rollout strategies, and priority classes for critical services
- Standardize Helm chart patterns, deployment practices, and IaC workflows
- Contribute to incident response playbooks, maintenance modes, and recovery patterns
- Create Paved Paths And Golden Templates For Developers
- Self-service software templates, CLI/automation enhancements, and bootstrap flows
- Clear docs, runbooks, and technical blogs that drive adoption and reduce support load
- Mentor And Grow
- Pair with teammates to land complex changes, champion TDD and code review quality, and uplevel practices within the team
- Anticipate and surface blockers, delays, and tradeoffs before they require escalation – keeping stakeholders informed and expectations aligned
Requirements:
- Staff-level impact building platform capabilities with clear outcomes in developer velocity, reliability, and/or quality
- Deep experience in one or more of the following and working knowledge of the rest: CI/CD systems and reusable pipeline components (e.g., GitHub Actions or similar)
- Kubernetes, Helm, GitOps (e.g., ArgoCD), and infrastructure as code (e.g., Terraform)
- AI for engineering: prompt design, multi-agent orchestration, safety/guardrails, and integrating LLMs into code and review workflows
- Data and analytics for platform decisions (e.g., Python/Node-based ETL, SQL/warehouses, lightweight dashboards)
- Proficiency in at least two languages used for platform work (e.g., Python, TypeScript/Node.js, Bash); comfortable moving across stacks to ship end-to-end solutions
- Strong security and reliability instincts (secrets, least privilege, dependency hygiene, runtime hardening)
- Clear, concise written communication – from crisp docs and service manuals to guides that accelerate adoption across the team
- Able to communicate effectively with a diverse team and external stakeholders; actively listens and ensures shared understanding