NetDocuments is the world’s #1 trusted cloud-based content management and productivity platform that helps legal professionals do their best work. As a Staff Cloud Engineer, you will manage and optimize AWS infrastructure, ensuring high availability, security, and performance while implementing disaster recovery and automation strategies.
Responsibilities:
- Operate and optimize AWS Virtual Private Clouds (VPCs), Route53, Content Delivery Networks (CDNs), load balancers, and EC2 instances
- Ensure the infrastructure is configured and maintained to meet high availability, security, and performance requirements
- Monitor and troubleshoot infrastructure issues, proactively identifying and resolving potential bottlenecks or vulnerabilities
- Design and implement robust disaster recovery and resiliency automation solutions spanning across AWS Availability Zones
- Conduct regular testing and validation of the disaster recovery plan to ensure its effectiveness and reliability
- Collaborate with cross-functional teams to identify and mitigate risks, ensuring business continuity in the event of a disaster
- Collaborate with the team to provide operational support during incidents, utilizing your deep understanding of AWS infrastructure
- Participate in incident response activities, including troubleshooting, root cause analysis, and implementing corrective actions
- Contribute to the development and improvement of incident response processes and procedures
- Implement and maintain effective monitoring solutions, including log management, metrics, collection, and alerting
- Analyze monitoring data to identify performance trends, capacity needs, and potential areas for optimization
- Use Observability best practices with AWS CloudWatch and DataDog
- Continuously evaluate and optimize the AWS infrastructure to improve performance, scalability, and cost efficiency
- Implement automation and orchestration tools to streamline infrastructure provisioning, configuration, and management processes
- Stay updated with the latest AWS services, features, and best practices, incorporating them into the infrastructure architecture as appropriate
- Maintain accurate and up-to-date documentation of the infrastructure architecture, configurations, and operational procedures
- Collaborate with cross-functional teams, including developers, system administrators, and security teams, to ensure effective communication and alignment of infrastructure requirements
- Develop and maintain infrastructure-as-code (IaC) using tools like Terraform or CloudFormation
- Implement and enhance CI/CD pipelines for efficient and automated application deployments. Experience with Harness, ADO Pipelines, or GitHub Actions
- Drive the adoption of DevOps practices, including configuration management, continuous integration, and automated testing
- Deep knowledge and proficiency in automation scripting best practices using tools like Python, and/or PowerShell
Requirements:
- Bachelor's degree in computer science, Engineering, or a related field, or equivalent practical experience
- 10+ years with hands-on experience in AWS
- AWS Certification required
- Ability to independently deliver features following story descriptions and acceptance criteria
- Proficiency in engaging in technical discussions with senior engineers and architects, requiring minimal supervision for individual tasks
- Proficiency in automation tools such as Terraform, GitHub Actions; and scripting in Python, Bash or PowerShell
- Ability to offer guidance to other Cloud Engineers and work collaboratively within the team
- Advanced AWS Certifications are a plus