Veeam is the Data and AI Trust Company, specializing in helping organizations ensure their data and AI are fully understood, secured, and resilient. The role involves operating cloud infrastructure with a strong focus on Microsoft Azure and AI/ML-related solutions, creating a reliable and automated platform that supports AI services and workloads.
Responsibilities:
- Working with the Microsoft Azure cloud platform, including designing, deploying, and optimizing infrastructure
- Contribute to the design and operation of AI-related components in Azure (depending on the team’s scope), such as:Azure OpenAI, Azure AI Search, Azure Machine Learning, AKS for inference/serving, and event-driven integrations
- Leverage expertise in AWS and, ideally, GCP to ensure comprehensive cloud platform management
- Implement Infrastructure as Code (IaC) methodologies
- Develop scripts for automation, enhancing efficiency in cloud processes
- Collaborate with cross-functional teams in requirements analysis, solution development, and implementation to ensure high system performance and reliability
Requirements:
- Experience in managing cloud infrastructure, with a focus on Microsoft Azure (AWS and GCP experience is a plus)
- Experience with Azure AI services (Azure OpenAI/AI Search/AML) and operating AI workloads in production (rate limits, data security, observability)
- Experience deploying secure applications in Azure: Azure Container Registry, Azure Container Apps, Application Gateway, etc
- Proficiency in Infrastructure as Code (IaC) practices, utilizing tools like Terraform
- Ability to develop and maintain Python scripts
- Strong analytical and problem-solving skills to troubleshoot issues and optimize cloud infrastructure
- Understanding of DevOps principles, with experience in continuous integration and deployment
- Effective communication skills and a collaborative approach to teamwork