NetSuite is a company that specializes in cloud solutions and is part of Oracle. They are seeking a Senior Site Reliability Engineer to manage and ensure the reliability of Oracle databases and Exadata infrastructure, focusing on automation, operational improvements, and support for migrations to Oracle Cloud.
Responsibilities:
- Support day-to-day operations of Oracle databases and Exadata (Prod and Non-Prod), including incident response and on-call support as needed (in alignment with local regulations)
- Triage database alerts and issues; perform deep-dive troubleshooting, root cause analysis, and implement corrective/preventative actions
- Perform capacity management, performance analysis, and reliability improvements for database platforms
- Maintain and support Non-Production and Production Standby environments and associated replication/high availability capabilities (e.g., Data Guard; GoldenGate as applicable)
- Develop and improve automation, tooling, and scripts to reduce toil and improve operational consistency
- Create and maintain documentation including runbooks, standard operating procedures, and knowledge-transfer materials
- Define and mature in-house standards, policies, and best practices aligned with industry standards, cyber requirements, and Oracle Maximum Availability Architecture (MAA) principles
- Mentor teammates and facilitate communication with leadership, clients, and cross-functional engineering partners
- Contribute to roadmap projects, including migration planning/execution for OCI and Autonomous Database
- Demonstrate practical experience using AI-assisted techniques/tools to improve developer productivity and quality (e.g., faster prototyping, stronger test coverage, safer refactoring, better documentation)
- Apply an AI-first mindset to day-to-day work: generating and validating code suggestions, creating/maintaining tests, and improving observability and runbooks—while maintaining strong engineering judgment
- Understand and follow enterprise security and privacy requirements when using AI tooling (e.g., protect sensitive data, use approved tools/workflows)
Requirements:
- 6+ years of experience as an Oracle DBA, Site Reliability Engineer, or Oracle Database Architect
- 6+ years of experience managing scalable on-prem and/or cloud-native distributed systems
- Bachelor's degree or Master's degree in Information Technology, Computer Science, Mathematics, Computer Engineering, or related field (or equivalent practical experience)
- Ability to work effectively in a collaborative, cross-functional environment
- Strong grasp of core Computer Science concepts
- Hands-on experience with PL/SQL and Python, Perl, and/or Shell scripting
- Experience supporting production databases running on Exadata
- A minimum of 5+ years experience of running large scale customer facing web services
- Oracle Maximum Availability Architecture (MAA) and Exadata best practices
- High availability & replication technologies (e.g., Data Guard, GoldenGate)
- Advanced scripting/coding and automation engineering (Shell/Perl/Python)
- Advanced compression
- Security Technical Implementation Guides (STIGs) and secure operations practices
- Oracle Autonomous Database experience