Centene Corporation is a diversified, national organization that aims to improve health outcomes through technology. The Principal Site Reliability Engineer will lead complex projects focused on managing and maintaining platform infrastructure performance, reliability, and security, utilizing SRE practices and tools.
Responsibilities:
- Uses advanced experience to lead more complex projects from end-to-end that are focused on managing and maintaining optimum platform infrastructure performance, reliability, and security using SRE practices, observability tools, manual and automated procedures, documentation, people and processes and continuous delivery(CI/CD) tools, processes, and designs
- Leads the development and delivery of complex services to automate monitoring activities and provide critical information to facilitate response and resolution of performance and availability issues and incidents
- Leads the delivery of standardized and scalable software tools to ensure that systems operate without interruption at optimum performance and leads project teams throughout the deployment process
- Troubleshoots and analyzes service disruptions to determine the root cause of issues and develop solutions for improved reliability
- Designs architectures and creates software to improve the availability, scalability, and efficiency of the service at very large scale
- Acts as the subject matter expert for building and maintaining observability/monitoring for the application, monitoring key performance indicators, maintaining alerting, and continuously improving visibility
- Evaluates and improves the security, monitoring & reliability of the deployed systems
- Designs and implements solutions to identify strategies that increase system reliability and performance through on-call rotation and process optimization
- Authors and maintains technical reviews and documents findings for future informed decision making
- Designs and implements diagnostics infrastructure framework to improve product quality
- Owns, triages, investigates, and resolves service issues with emphasis on broad communications, learning, and teaching throughout the process
- Defines and drives change management, continuous integration, and deployment best practices
- Mentors, develops and delivers training to the engineering team on new systems, protocols, and best practices
- Drives and coaches others through reviews of design, code, and test cases
- Performs other duties as assigned
- Complies with all policies and standards
Requirements:
- A Bachelor's degree in a quantitative or business field (e.g., statistics, mathematics, engineering, computer science)
- Deep functional and Centene specific knowledge with 6 – 8 years of related experience
- Experience with Linux Operating System; Operating Systems; Unix Operating System; Windows Operating System
- Experience with observability/monitoring tools such as Splunk, Dynatrace, Elastic, New Relic, Prometheus, Grafana
- Experience with enterprise level CICD Tools such as Ansible, Jenkins, Cloudbees, OpenShift
- Experience working in public cloud platforms like AWS and Azure
- Experience with Programming Tools
- Experience building and operating highly scaled applications
- Experience with MongoDB; MySQL; Oracle Database Management System (DBMS); PL SQL; SQL (Programming Language)
- Experience with varying code repositories, auto deployments, branching with tools such as Gitlab, Bitbucket, Subversion
- Experience with IT service management tools such as Service Now, Atlassian, BMC
- Advanced - Seeks to acquire knowledge in area of specialty
- Advanced - Ability to identify basic problems and procedural irregularities, collect data, establish facts, and draw valid conclusions
- Advanced - Ability to work independently
- Advanced - Demonstrated analytical skills
- Advanced - Demonstrated project management skills
- Advanced - Demonstrates a high level of accuracy, even under pressure
- Advanced - Demonstrates excellent judgment and decision making skills
- Advanced - Ability to communicate and make recommendations to upper management
- Advanced - Ability to drive multiple projects to successful completion
- Advanced - Possesses technical aptitude