Skysoft Inc. is seeking an HPC Cloud Engineer with expertise in AWS and Terraform. The role involves designing and implementing high-performance computing architectures and managing cloud-based infrastructure using DevOps practices.
Responsibilities:
- Strong experience with AWS services: EC2, VPC, IAM, S3, FSx, EBS, CloudWatch
- Hands-on experience with AWS Parallel Cluster
- Solid understanding of Linux system administration (RHEL)
- Experience with HPC schedulers (Slurm preferred)
- Familiarity with GPU computing (NVIDIA drivers, CUDA, NCCL)
- Scripting skills in Bash and Python
- DevOps & Automation ( Github Action / Code Build)
- Infrastructure as Code using Terraform / CloudFormation
- Monitoring and performance tuning tools
- Experience with AWS Batch and containerized HPC workloads
- Understanding of hybrid HPC (on-prem + AWS)
- AWS Certifications (Solutions Architect, SysOps, or Specialty)
- Experience in HPC CFD & FEA applications
Requirements:
- Strong experience with AWS services: EC2, VPC, IAM, S3, FSx, EBS, CloudWatch
- Hands-on experience with AWS Parallel Cluster
- Solid understanding of Linux system administration (RHEL)
- Experience with HPC schedulers (Slurm preferred)
- Familiarity with GPU computing (NVIDIA drivers, CUDA, NCCL)
- Scripting skills in Bash and Python
- DevOps & Automation (Github Action / Code Build)
- Infrastructure as Code using Terraform / CloudFormation
- Monitoring and performance tuning tools
- Experience with AWS Batch and containerized HPC workloads
- Understanding of hybrid HPC (on-prem + AWS)
- AWS Certifications (Solutions Architect, SysOps, or Specialty)
- Experience in HPC CFD & FEA applications