Home
Jobs
Saved
Resumes
Lead DevOps Engineer at Paramount | JobVerse
JobVerse
Home
Jobs
Recruiters
Companies
Pricing
Blog
Jobs
/
Lead DevOps Engineer
Paramount
Remote
Website
LinkedIn
Lead DevOps Engineer
New York, United States of America
Full Time
2 weeks ago
$139,000 - $190,000 USD
Visa Sponsor
Apply Now
Key skills
AWS
Azure
Cloud
Google Cloud Platform
Jenkins
Kafka
Kubernetes
Prometheus
Python
Redis
Terraform
Go
Bash
ML
Analytics
GCP
Google Cloud
GitHub Actions
Helm
ArgoCD
Pub/Sub
New Relic
OpenTelemetry
GitHub
Memcached
Caching
CI/CD
About this role
Role Overview
Design, implement, and manage scalable and reliable infrastructure for online inference services
Optimize Kubernetes-based deployments for low-latency model serving and real-time personalization
Automate CI/CD pipelines to streamline the deployment of ML models and services
Develop observability and monitoring solutions using tools like Prometheus, New Relic, and OpenTelemetry
Ensure high availability, security, and performance of real-time inference APIs
Work with ML engineers and backend teams to integrate inference models efficiently into production
Implement autoscaling strategies for inference workloads based on traffic patterns and model demand
Manage Pub/Sub and event-driven architectures to enable real-time messaging and engagement analytics
Optimize model-serving infrastructure using Redis, Memcached, and other caching strategies
Debug and tackle production issues related to latency, scaling, and reliability
Requirements
4+ years of experience in DevOps, Site Reliability Engineering (SRE), or Cloud Infrastructure Engineering
Solid experience with Kubernetes and container orchestration
Hands-on experience with CI/CD tools such as GitHub Actions, Jenkins, and ArgoCD
Experience working with real-time inference and ML model deployment
Deep knowledge of Google Cloud Platform (GCP), AWS, or Azure
Expertise in infrastructure as code (IaC) using Terraform or Helm
Experience with message queues and event-driven architectures (Pub/Sub, Kafka, etc.)
Proficiency in monitoring and logging solutions (New Relic, Prometheus, OpenTelemetry, etc.)
Deep scripting skills in Python, Bash, or Go for automation
Tech Stack
AWS
Azure
Cloud
Google Cloud Platform
Jenkins
Kafka
Kubernetes
Prometheus
Python
Redis
Terraform
Go
Benefits
Medical
Dental
Vision
401(k) plan
Life insurance coverage
Disability benefits
Tuition assistance program
Paid time off
Apply Now
Home
Jobs
Saved
Resumes