Apply software engineering techniques, automation, and best practices in incident response
Ensure the reliability, availability, and scalability of the systems, platforms, and technology
Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning
Resolution, analysis and response to system outages and disruptions
Implement measures to prevent similar incidents from recurring
Development of tools and scripts to automate operational processes
Monitoring and optimisation of system performance and resource usage
Collaboration with development teams to integrate best practices for reliability, scalability, and performance
Stay informed of industry technology trends and innovations
Engage in complex analysis of data from multiple sources to solve problems creatively and effectively
Requirements
5+ years of MongoDB administration in production environments, preferably in financial services
Deep expertise in MongoDB architecture: replica sets, sharding, backup/recovery strategies, and disaster recovery
Performance tuning and optimization: query analysis, indexing strategies, and capacity planning
Proficiency in MongoDB shell, JavaScript scripting, and aggregation pipelines
Strong troubleshooting skills for production incidents and performance degradation
Security best practices: authentication, encryption at rest/in transit, audit logging
Scripting expertise in Python and/or Bash for automation and operational tasks
CI/CD pipeline development and maintenance
Version control with Git and collaborative development practices
Database migration and upgrade strategies with zero/minimal downtime
Experience with observability platforms: Prometheus, Grafana, ELK/EFK stack, or similar
Incident management, root cause analysis, and post-mortem documentation
On-call rotation experience and production support
Strong communication skills for cross-functional collaboration
Proactive problem-solving and ownership mentality
Documentation and knowledge-sharing practices
Some other highly valued skills may include: Percona Server for MongoDB or MongoDB Enterprise experience, API development with FastAPI, Flask, or similar frameworks, Infrastructure as Code (IaC) using Terraform, Ansible, or Chef, Container orchestration with Kubernetes and Docker