Striveworks helps organizations harness the power of artificial intelligence to solve real-world national security and business challenges. As a Senior DevOps Engineer, you will take ownership of product deployments and ensure seamless integration and optimization of software solutions in various environments.
Responsibilities:
- Automating IaC to manage virtual machines and deploy containers, services, and other infrastructure; leaning on expertise to deploy custom Kubernetes clusters in AWS, Azure, GCP, on-premises, or hybrid cloud environments
- Working with platform developers, other DevOps teammates, and customer-facing teams to define requirements and build solutions for customer use cases of the platform
- Software deployments to commercial and, later, unclassified, CUI, and classified Department of Defense (DOD) networks
- Incident response and initial triage of critical system faults
- Monitoring, automating, and improving software reliability, performance, and availability for various projects
- Acting as a liaison between platform developers and customer-facing teams, taking on operational tasks to ensure the efficient functioning of Striveworks’ solutions
- Providing guidance and leadership to junior DevOps team members
- Contributing to the success of mission-critical systems within national security and commercial clients
- Wearing multiple hats and stepping into vacuums where improvements are needed
- Exploring new technologies and solutions
Requirements:
- 6+ years of direct, hands-on experience in Python and/or Golang programming, or other general purpose programming languages
- Microservice deployment in Kubernetes
- Diagnosing and resolving issues within containerized environments
- Helm Chart and Kustomizations development/deployment
- Automation and IaC (e.g., Terraform, Ansible)
- Cloud infrastructure (e.g., AWS, Azure, GCP, or OpenStack)
- Managing and troubleshooting Linux systems (e.g., RHEL, Ubuntu, CentOS)
- The ability to work cross functionally to define requirements and build solutions for customer use cases of the platform
- The ability to respond professionally and competently to incident reports and triage critical system faults
- Eligibility and willingness to obtain and maintain a Secret (or above) US security clearance
- Due to the nature of this role, candidates must have US citizenship
- Active Secret (or above) US security clearance, and familiarity with DOD networking, tools, infrastructure, security requirements, and policies
- Proficiency with US federal information system security policies, including Security Technical Implementation Guides (STIGs), NIST 800-171, NIST 800-53, CMMC, and ICD 503
- Experience with software deployments to on-premises and cloud-based unclassified, CUI, and classified networks within the DOD
- Experience with DevSecOps/DevOps and CI/CD for the administration and deployment of GPU-enabled servers
- Experience deploying or maintaining Cloud Native Computing Foundation (CNCF) projects
- Experience with network-attached storage (NAS) and storage area network (SAN) technologies
- Experience with Kubernetes and cloud-native applications and services in denied, disrupted, intermittent, and limited impact (DDIL) environments