Lead the architecture, management, and optimization of Oracle 19c RAC, MySQL 8, and PostgreSQL database systems across on-premises, AWS, and OCI cloud environments.
Define and drive database standards, best practices, and operational runbooks across the organization.
Own database upgrade and migration strategies for Oracle and MySQL, ensuring minimal downtime and risk mitigation across all environments.
Lead advanced database troubleshooting, root cause analysis, and performance engineering across Oracle, MySQL, PostgreSQL, HAProxy, and Keepalived.
Architect and maintain database replication topologies including master-slave, master-master, and cross-region configurations for disaster recovery.
Design and implement automation for database provisioning, patching, scaling, and failover using Python, Terraform, and Ansible.
Own backup and recovery strategy using RMAN, mysqldump, pg_dump, and point-in-time recovery, ensuring RPO/RTO targets are met and regularly tested.
Implement and maintain database observability including monitoring, alerting, dashboards, and capacity forecasting using Grafana, CloudWatch, or equivalent platforms.
Collaborate with application development teams on schema design, query optimization, indexing strategies, and capacity planning.
Mentor and guide junior database engineers, conducting code and architecture reviews.
Participate in on-call rotations as a senior escalation point for critical database incidents.
Evaluate emerging database technologies and make recommendations for adoption where appropriate.
Requirements
7+ years of hands-on experience in database engineering or DBA roles, with at least 3 years in a senior or lead capacity.
Expert-level knowledge of Oracle 19c RAC administration, including RMAN, Data Guard, patching, and advanced query optimization.
Deep experience with MySQL 8 administration, replication architectures (master-slave, master-master, GTID-based), and performance tuning at scale.
Strong working knowledge of PostgreSQL administration, replication, and performance tooling.
Proven experience designing and operating high-availability and disaster recovery database architectures in production.
Strong proficiency in Linux systems administration (Ubuntu/RHEL-based) and shell scripting.
Extensive experience with automation and infrastructure-as-code: Python, Terraform, Ansible.
Solid understanding of high-availability components: HAProxy, Keepalived, or equivalent load balancing and failover solutions.
Experience with Git-based workflows, CI/CD practices, and database change management processes.
Demonstrated ability to lead incident response, perform root cause analysis, and drive post-incident improvements.
Excellent communication skills with the ability to influence technical decisions and collaborate effectively across time zones in a distributed team.