Oracle is a leading company in AI and cloud solutions that impact billions of lives. As a Site Reliability Engineer, you will be responsible for managing and ensuring the reliability of Oracle databases and Exadata infrastructure, including troubleshooting, capacity management, and driving automation improvements.
Responsibilities:
- Support day-to-day operations of Oracle databases and Exadata (Prod and Non-Prod), including incident response and on-call support as needed (in alignment with local regulations)
- Triage database alerts and issues; perform deep-dive troubleshooting, root cause analysis, and implement corrective/preventative actions
- Perform capacity management, performance analysis, and reliability improvements for database platforms
- Maintain and support Non-Production and Production Standby environments and associated replication/high availability capabilities (e.g., Data Guard; GoldenGate as applicable)
- Develop and improve automation, tooling, and scripts to reduce toil and improve operational consistency
- Create and maintain documentation including runbooks, standard operating procedures, and knowledge-transfer materials
- Define and mature in-house standards, policies, and best practices aligned with industry standards, cyber requirements, and Oracle Maximum Availability Architecture (MAA) principles
- Mentor teammates and facilitate communication with leadership, clients, and cross-functional engineering partners
- Contribute to roadmap projects, including migration planning/execution for OCI and Autonomous Database
- Demonstrate practical experience using AI-assisted techniques/tools to improve developer productivity and quality (e.g., faster prototyping, stronger test coverage, safer refactoring, better documentation)
- Apply an AI-first mindset to day-to-day work: generating and validating code suggestions, creating/maintaining tests, and improving observability and runbooks—while maintaining strong engineering judgment
- Understand and follow enterprise security and privacy requirements when using AI tooling (e.g., protect sensitive data, use approved tools/workflows)
Requirements:
- 6+ years of experience as an Oracle DBA, Site Reliability Engineer, or Oracle Database Architect
- 6+ years of experience managing scalable on-prem and/or cloud-native distributed systems
- Bachelor's degree or Master's degree in Information Technology, Computer Science, Mathematics, Computer Engineering, or related field (or equivalent practical experience)
- Ability to work effectively in a collaborative, cross-functional environment
- Strong grasp of core Computer Science concepts
- Hands-on experience with PL/SQL and Python, Perl, and/or Shell scripting
- Experience supporting production databases running on Exadata
- Oracle Database administration and operations
- Oracle Grid Infrastructure, ASM & RAC
- Oracle Cloud (OCI) fundamentals (migration and/or operations)
- Scripting/automation (PL/SQL, shell, Python/Perl)
- Observability and operational readiness (monitoring/alerting, runbooks, incident response)
- A BS or MS in Computer Science, or equivalent
- Identifies and implements complex solutions to knowledge of server hardware and software configuration, networking, standard internet services, scripting languages, cloud computing patterns, technology security and compliance
- Experience running large scale customer facing web services
- Identifies and implements complex solutions to understanding of load balancing technologies and experience with development in programming languages, databases and big data stores, and container technologies
- Work involves defining and documenting technical architecture of complex and highly scalable products
- A minimum of 8+ years experience of running large scale customer facing web services
- Oracle Maximum Availability Architecture (MAA) and Exadata best practices
- High availability & replication technologies (e.g., Data Guard, GoldenGate)
- Advanced scripting/coding and automation engineering (Shell/Perl/Python)
- Advanced compression
- Security Technical Implementation Guides (STIGs) and secure operations practices
- Oracle Autonomous Database experience