Motive is a software solutions company seeking a Managed Services Site Reliability Engineer to deploy, operate, and maintain customer applications across Linux bare metal servers and Red Hat OpenShift platforms. The role focuses on application deployment, release management, and operational support in a live production environment, while ensuring systems meet SLA requirements and improving reliability and automation practices.
Responsibilities:
- Deploy, manage, and maintain applications on Linux bare metal servers and OpenShift/Kubernetes clusters
- Execute CI/CD pipelines and ensure reliable, repeatable releases across hybrid environments
- Build and maintain observability for deployed applications using Prometheus, Grafana, Zabbix
- Implement and maintain centralized logging solutions using Grafana Loki, OpenSearch/Elasticsearch, Fluentd/Fluent Bit
- Develop automation scripts to streamline deployments and reduce operational toil (Bash, Python, JavaScript)
- Participate in incident response and troubleshoot application or platform issues in a live production environment
- Support night-time deployments and carry pager on rotation; respond to emergencies, including weekends and holidays
- Collaborate with internal teams to continuously improve deployment reliability and efficiency
- Learn new technologies, take direction, and develop skills as needed
Requirements:
- Minimum 5 years Linux system administration experience
- Minimum 5 years Kubernetes (K8s) experience
- Exposure to RedHat OpenShift
- Experience with application servers such as JBoss or WebLogic
- Experience with monitoring tools: Zabbix, Prometheus, Grafana
- Experience with logging pipelines: Elasticsearch, Logstash, Kibana (ELK)
- Exposure to web servers – Apache, Nginx
- Experience with Ansible
- Basic networking skills
- Basic SQL skills
- Strong troubleshooting skills and ability to operate in a live production environment
- Willingness to carry pager for on-call rotations (typically a week at a time)
- Willingness to support night-time deployments
- Willingness to work off-hours, including weekends and holidays in emergencies
- Willingness to learn on the fly and develop new skills