Design and evolve distributed, resilient, and scalable database architectures on AWS, focusing on high availability and disaster recovery (DR).
Administer and perform deep tuning of MySQL/MariaDB engines and TiDB clusters, ensuring performance in high-concurrency scenarios.
Lead architectural design reviews with Product and Engineering teams, ensuring data design supports business growth.
Build robust automation using Python, Go, or Bash for provisioning, maintenance, and self-healing of infrastructure (IaC).
Implement and maintain an advanced observability strategy that goes beyond basic monitoring to include distributed tracing and structured logs for rapid anomaly detection.
Mentor analysts and engineers, raising the team's technical bar in modern database and cloud practices.
Ensure data security by design, implementing hardening, encryption (in transit and at rest), access management (IAM with least-privilege principle) and compliance with network policies.
Lead critical incident responses (War Rooms), conducting post-mortems focused on preventing recurrence.
Requirements
Strong expertise in MySQL/MariaDB: Deep knowledge of internal architecture (InnoDB), replication (GTID, Galera, Async/Semi-sync), partitioning, manual sharding, and troubleshooting complex performance issues.
Experience with CDC (Change Data Capture) tools or migrating monoliths to distributed architectures.
Ability to diagnose issues beyond CPU/RAM metrics, by interpreting wait events, slow query logs, and distributed tracing.
Hands-on experience with EC2 (instance families, EBS IOPS, Nitro), VPC (Peering, Transit Gateway, PrivateLink), S3 and advanced IAM in both managed and self-managed environments.
Solid knowledge of database hardening, encryption (KMS, TLS/SSL), secret management (Secrets Manager/Vault), and compliance (Security Hub).
Fluency in Bash and at least one robust scripting language (Python or Go) to create automations, internal CLIs and maintenance scripts.
Proven experience in cloud cost management (rightsizing, using Spot/Graviton instances and tagging strategies for cost allocation).
Experience with TiDB (or similar NewSQL) is a plus.
Knowledge of Infrastructure as Code (IaC), with experience in Terraform or similar for provisioning databases.
Experience operating databases inside Kubernetes is a big plus.
Development background: Ability to read and debug application code (PHP, Node.js or Go) to help developers refactor inefficient queries at the source.
Experience defining SLIs/SLOs/SLAs, Error Budgets and using modern monitoring tools (Prometheus, Grafana).
Tech Stack
AWS
EC2
Grafana
JavaScript
Kubernetes
MariaDB
MySQL
Node.js
PHP
Prometheus
Python
Terraform
Vault
Go
Benefits
Health insurance;
Dental insurance;
Meal allowance or grocery allowance;
Childcare assistance;
Profit-sharing program (PPR);
Day off during your birthday month;
Life insurance;
Wellhub;
Férias&Co (travel benefit);
Zenklub (psychotherapy/nutritional counseling);
6-month maternity leave and 20-day paternity leave;
Flexible working hours;
Education assistance and language course assistance;