Lead the design, deployment, and maintenance of secure and scalable AI infrastructure to support the execution of AI workloads across on-prem and cloud environments.
Architect and manage compute platforms, storage systems, and containerized environments optimized for AI inference and data processing while ensuring system reliability, performance, and observability.
Collaborate with cross-functional teams to integrate AI models, AI services, and runtime environments into operational pipelines.
Provide subject matter expertise on AI infrastructure and AI integration to the government customer; and establish/maintain a high level of customer trust and confidence with your knowledge and skills through your creativity to provide innovative solutions that fit the customer’s needs.
Requirements
Active Top Secret with SCI eligibility security clearance.
BA/BS degree in IT related field (e.g., IT, Cybersecurity, Computer Science, Information Systems, Data Science, or Software Engineering); 8140 certification in in lieu of degree.
Experience in deploying and managing AI/ML workloads (e.g., Ray, Dask, Kubeflow, KServe, MLflow) with Kubernetes.
5 years of experience in designing, deploying, operating, and maintaining AI-related infrastructure
5 years of experience in designing, deploying, operating, and maintaining containerized or virtualized infrastructure.
Demonstrated experience supporting projects of similar size, scope, and complexity.