DataRobot is a company that delivers AI solutions to maximize impact and minimize business risk. They are seeking a DevOps Engineer II to work collaboratively with engineers to architect efficient, reliable, and scalable software systems, focusing on Kubernetes and cloud computing.
Responsibilities:
- Rightsize workloads for efficient resource utilization in Kubernetes and cloud service
- Contribute to the development, deployment and operations for new microservices within the platform
- Review technical specifications to provide guidance and help development teams drive operational excellence
- Work hand-in-hand with software developers to facilitate the development and adoption of 'Paved Road' solutions and DevSecOps processes
- Support large-scale services across multiple environments
- Assist in resolution efforts for problems ranging from infrastructure network layers to application scaling
- This role includes participation in an on-call rotation - we believe in shared ownership of our platform and aim to build systems that are resilient, observable, and require minimal intervention
Requirements:
- 4+ years of experience in DevOps, systems engineering, or a related role
- Strong experience with Kubernetes
- Strong experience with Helm
- Strong experience with Python
- Strong experience with Terraform
- Strong experience with Linux
- Strong experience with Git and Github
- Strong experience working with at least one major cloud platform (AWS, Azure, or GCP)
- Fundamental understanding of Kubernetes and Helm
- Experience in building and running software systems on Kubernetes clusters in production
- Hands-on experience with infrastructure provisioning and configuration using Infrastructure as Code (IaC) principles
- An understanding of design for scalability, performance, efficiency and reliability
- Self-motivated and proactive, able to take ownership and deliver results
- Ability and willingness to learn about new technologies
- Effective communication with technical and non-technical stakeholders
- CKAD (Certified Kubernetes Application Developer) certification
- Experience with Rightsizing workloads
- Cloud cost dashboards
- Harness CI/CD
- Artifactory
- MongoDB
- RabbitMQ
- Postgres
- Redis
- Knowledge of TCP/IP networking, SSL, DNS, Load Balancers
- CI/CD pipeline experience
- Real-world experience decoupling monolithic software into smaller reusable components
- Handle high-pressure situations in a calm and professional manner