Oracle is a technology leader that’s changing how the world does business, and they are seeking a Site Reliability Engineer DevOps to join their new Oracle Health organization. The role focuses on product deployment, sustainability, troubleshooting, and product strategy while ensuring reliability and performance in a cloud environment.
Responsibilities:
- Take ownership of the architecture, analysis, design, implementation and production operations of a wide array of Core System Framework solutions
- React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems
- Be a strong contributor to supporting and development of platform services including architecture, provisioning, configuration, deployment, and support
- Partner with the distributed team in prototyping new platform services
- Stay informed of new technologies
- Innovate
- Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence
- Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services
- Develop designs, architectures, standards, and methods for large-scale distributed systems
- Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning and performance
Requirements:
- 3-5 years of experience as a Site Reliability or DevOps Engineer
- The ability to acquire & maintain a federal security clearance vital for this role, which requires you to be a US citizen
- Developing/operating large scale distributed services / applications
- Container administration and development applying Kubernetes, Docker, Mesos, or similar
- Infrastructure automation through Terraform, Chef, Ansible, Puppet, Packer or similar
- Experience with Cloud Orchestration frameworks, development and SRE support of these systems
- Experience with CI/CD pipelines including VCS (git, svn, etc), Gitlab Runners, Jenkins, Rundeck
- Working with or supporting production, test, and development environments for medium to large user environments
- Experience in developing scripts to automate software deployments and installations using PowerShell or Bash
- Knowledge of cloud compute technologies, network monitoring, data processing and analytics
- Experience with a modern programming language such as Java, Python, or C++ or equivalent
- Experience working with fault tolerant, highly available, high throughput, distributed, scalable systems
- Experience operating services in one of the major Clouds such as AWS, OCI, Azure, etc