Own and architect high-availability MySQL database platforms supporting critical business systems.
Design and implement multi-region replication strategies, disaster recovery architectures, and zero-downtime migration patterns.
Lead incident response for critical database outages, coordinate cross-functional teams, and drive post-incident reviews.
Define and track database SLIs/SLOs, establish reliability metrics, and implement continuous improvement programs.
Drive capacity planning at scale, forecast growth patterns, and architect for horizontal and vertical scaling.
Establish database standards, operational runbooks, and best practices across the organization.
Lead automation initiatives to eliminate toil and improve operational efficiency.
Design and implement database provisioning platforms using Terraform and configuration management tools.
Architect and implement GitOps-driven database deployments integrated with CI/CD pipelines.
Develop custom tooling and automation frameworks for database operations (Python, Go, or Bash).
Requirements
8+ years of hands-on MySQL DBA/Engineering experience in production environments.
Expert-level knowledge of MySQL architecture, replication (async, semi-sync, GTID), performance tuning, and troubleshooting.
4+ years managing Cloud SQL on GCP, including HA configurations, read replicas, and backup strategies.
4+ years of Infrastructure as Code experience (Terraform strongly preferred), including module development and state management.
4+ years with configuration management tools (Ansible, Chef, or Puppet) in production environments.
Proven experience building and maintaining CI/CD pipelines for database changes and schema migrations.
Strong programming/scripting skills (Python, Bash, or Go) for automation and tooling development.
Deep Linux system administration skills and understanding of OS-level performance tuning.
Experience with monitoring and observability tools (Prometheus, Grafana, ELK, or similar).
Demonstrated leadership in incident response, including on-call rotations and post-mortem facilitation.
Experience supporting large-scale production systems with strict uptime requirements (99.9%+).
Tech Stack
Ansible
Chef
Cloud
Google Cloud Platform
Grafana
Linux
MySQL
Prometheus
Puppet
Python
SQL
Terraform
Go
Benefits
Personal time off 15 days
Casual (6) days & Sick Days (6)
Medical Insurance: 8 Lakhs Family Floater that covers employee, spouse, 3 children and parents or In-laws
Group Personal Accident: 3x of the employee’s Gross Salary
Group Term Life: 3x of the employee’s Gross Salary
Volunteering Days – Use them to make a difference in your community. Whether it's a cleanup, supporting a local initiative or holding an “Ehrenamt”, just let us know! We're here to help you create a meaningful impact.
Employee Stock Program (ESPP): Participate in our Employee Stock Participation Program and receive a discount on company shares, allowing you to share in LivePerson's success.
Learning & Development: We actively support your professional journey with robust programs for growth and learning development, including allocated stipends, ensuring you reach your full potential.