Oracle is a leading company that powers innovations through data, infrastructure, and applications. As a Reliability Engineer, you will improve availability and reduce risk across mission-critical facility systems, utilizing data-driven analysis to enhance reliability at scale.
Responsibilities:
- Monitor and analyze operational telemetry, alarms, and performance trends to identify emerging risks and reliability degradation
- Define and track reliability KPIs; deliver concise analysis and recommendations that drive operational and engineering decisions
- Develop and maintain analytics and reporting tools using Python, SQL, and/or DCIM/BMS/SCADA data sources
- Support and/or lead RCAs and corrective action tracking for recurring or high-impact issues, ensuring follow-through and verification
- Partner with operations and engineering teams to improve preventive strategies, automation opportunities, and compliance execution
- Contribute to reliability standards and documentation that improve repeatability across sites
Requirements:
- Experience in reliability or systems analysis in data centers or other uptime-critical environments (utilities, telecom, manufacturing)
- Engineering degree or equivalent applied experience; comfort with data and tooling is required for this to be real
- Strong analytical and visualization skills; disciplined technical documentation
- Able to influence outcomes through evidence, clarity, and structured thinking