Home
Jobs
Saved
Resumes
JobVerse
Home
Jobs
Recruiters
Companies
Pricing
Blog
Jobs
/
Site Reliability Engineer – Operations
SS&C Technologies
Website
LinkedIn
Site Reliability Engineer – Operations
Kansas City, Florida, United States of America
Full Time
2 hours ago
No H1B
Apply Now
Key skills
Ansible
AWS
Azure
Cloud
DNS
Google Cloud Platform
Grafana
Linux
Prometheus
Puppet
Python
Shell Scripting
Splunk
Terraform
Unix
VMware
Go
C
Shell
Bash
GCP
Google Cloud
Datadog
Load Balancing
CI/CD
About this role
Role Overview
Maintain and improve the uptime, performance, and availability of production systems.
Define and track SLIs , SLOs , and SLAs to ensure service reliability and user satisfaction.
Implement and manage monitoring, alerting, and observability tools (e.g., Prometheus, Grafana, Datadog, ELK).
Participate in on-call rotations and respond to incidents, performing root cause analysis and postmortems.
Automate repetitive tasks and processes using scripts, configuration management, and Infrastructure as Code (IaaC).
Develop CI/CD pipelines to streamline deployment and operational processes.
Analyze system performance and capacity trends to plan for future growth.
Collaborate with engineering teams to design systems that scale reliably.
Support cloud and/or hybrid infrastructure (AWS, Azure, GCP, VMware, etc.).
Manage system provisioning, configuration, and patching via tools such as Ansible, Terraform, or Puppet.
Act as a bridge between development and operations teams, championing DevOps and SRE principles.
Contribute to a culture of continuous improvement, reliability, and accountability.
Requirements
Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
3+ years of experience in a Site Reliability, DevOps, or Systems Engineering role.
Experience with Linux/Unix systems , Windows , shell scripting, and administration.
Proficiency in at least one programming/scripting language (Python, Go, Bash, etc.).
Hands-on experience with cloud platforms ( AWS , Azure , or GCP ).
Strong knowledge of networking, security, load balancing, and DNS.
Experience with monitoring/logging tools (e.g., Prometheus, Grafana, ELK, Splunk, Datadog).
Tech Stack
Ansible
AWS
Azure
Cloud
DNS
Google Cloud Platform
Grafana
Linux
Prometheus
Puppet
Python
Shell Scripting
Splunk
Terraform
Unix
VMware
Go
Benefits
Flexibility : Hybrid Work Model & a Business Casual Dress Code, including jeans
Your Future: 401k Matching Program, Professional Development Reimbursement
Work/Life Balance: Flexible Personal/Vacation Time Off, Sick Leave, Paid Holidays
Your Wellbeing: Medical, Dental, Vision, Employee Assistance Program, Parental Leave
Diversity & Inclusion: Committed to Welcoming, Celebrating and Thriving on Diversity
Training: Hands-On, Team-Customized, including SS&C University
Extra Perks: Discounts on fitness clubs, travel and more!
Apply Now
Home
Jobs
Saved
Resumes
Site Reliability Engineer – Operations at SS&C Technologies | JobVerse