Oracle is a leading company that brings together data, infrastructure, applications, and expertise to power industry innovations. They are seeking a Senior Site Reliability Engineer / DevOps to manage daily operational tasks for their CareAware Cloud SaaS, ensuring server performance and compliance with Service Level Agreements while contributing to cloud build-out and client migrations.
Responsibilities:
- As a member of the RTHS DevOps team you will be responsible for daily operational tasks required to run it for all our cloud clients
- You will monitor and maintain server performance, availability, and ensure compliance to Service Level Agreements
- You will address operational systems issues as needed
- You will deploy new code, onboard new clients or new solutions and complete technology upgrades
- As we move into the future projects, we have critical involvement in our OCI cloud build out and client migrations giving an opportunity to get involved from the ground of these new regions and apply dev ops thinking from the beginning
- This role is expected to contribute in helping define Kubernetes future architecture decisions and automation to operationalize and simplify management of numerous clusters
Requirements:
- Deep Linux knowledge
- Deep Kubernetes knowledge
- System Monitoring and troubleshooting
- Networking Monitoring and troubleshooting
- Cloud experience in OCI or AWS
- US Citizenship is a requirement for this role