Agiloft is a trusted global leader in data-first contract lifecycle management (CLM) software. They are seeking an experienced Staff Software Engineer – Cloud Services to own and evolve the architecture of their cloud-native CLM platform and supporting infrastructure, focusing on design, reliability, scalability, security, and long-term health of cloud services.
Responsibilities:
- Own the architecture of one or more critical cloud service domains, with responsibility for design integrity, scalability, reliability, security, and technical debt management over time
- Act as the architectural steward for AWS-based services, ensuring systems remain scalable, resilient, cost-efficient, secure, and maintainable as customer and platform demands grow
- Define service boundaries, APIs, data contracts, and integration patterns that enable independent evolution of cloud services
- Drive cloud-native design principles, including stateless services, managed services adoption, and failure-tolerant architectures
- Lead the planning and execution of cloud architecture improvements that enhance scalability, availability, performance, and operational efficiency
- Translate architectural vision into mid- and long-term technical roadmaps, balancing near-term delivery with long-term cloud sustainability and cost optimization
- Drive cross-team alignment on design standards and architectural trade-offs
- Partner with CloudOps teams to define reliability targets, disaster recovery strategies, and operational readiness standards
- Champion modern cloud-native development practices, including infrastructure-as-code, automated testing, continuous delivery, and observability-by-default
- Lead adoption of AI-assisted software engineering for cloud service development, infrastructure definition, testing, troubleshooting, and refactoring
- Design cloud services and infrastructure that are AI-friendly by design, enabling safer automation, faster iteration, and improved operational insight
- Collaborate with engineering leadership to establish best practices and guardrails for responsible, secure use of AI tools across the cloud SDLC
- Set and uphold high standards for service reliability, security, observability, performance, and cost management
- Drive improvements to developer experience for cloud services, including local development, CI/CD pipelines, deployment workflows, and operational tooling
- Ensure architectural decisions, service contracts, and operational runbooks are well-documented, discoverable, and actionable
- Promote a strong culture of ownership and continuous improvement
- Mentor senior and mid-level engineers, raising the bar for cloud architecture, distributed systems design, and operational thinking
- Serve as a technical multiplier—unblocking teams, accelerating architectural decisions, and reducing long-term operational complexity
- Communicate complex cloud and systems concepts clearly to both technical and non-technical stakeholders
- Stay ahead of industry trends in AWS services, cloud-native architectures, AI-enabled operations, and DevOps practices, driving pragmatic adoption where it delivers clear business value
Requirements:
- Bachelor's and/or master's degree in computer science, Information Systems, Software, Electrical, or Electronics Engineering, or a comparable field of study
- 10+ years of professional software engineering experience
- 2+ years at staff level
- Strong proficiency in Python, with working experience in Java, building backend- and cloud-based services
- Deep experience designing, building, and operating scalable enterprise-grade applications on AWS
- Demonstrated experience owning and evolving architecture for complex, distributed, and long-lived cloud platforms
- Proven ability to lead service refactoring, software architecture modernization, and platform evolution initiatives while maintaining delivery velocity
- Strong experience with infrastructure-as-code and automation tools such as Terraform or CloudFormation
- Experience designing and evolving CI/CD pipelines for cloud services
- Strong understanding of observability, including logging, metrics, tracing, and alerting in distributed systems
- Proficiency with AI-assisted development processes and tools
- Ability to reason holistically about distributed systems while diving deep into critical technical details when needed
- Exceptional problem-solving skills with a track record of making sound architectural and operational trade-offs
- Demonstrated ability to influence technical direction across teams through expertise, communication, and trust
- Excellent written and verbal communication skills, particularly around architecture, risk, reliability, and long-term technical strategy