iSpot is a competitive startup seeking an experienced Senior Site Reliability Engineer with a focus on database management and infrastructure optimization. The role involves maintaining and scaling database systems, ensuring high performance and security, while collaborating with cross-functional teams to meet organizational needs.
Responsibilities:
- Develop and enforce database standards, best practices, and security protocols
- Migrate databases across accounts/regions prioritizing availability and latency
- Monitor and analyze database performance, identifying bottlenecks and implementing solutions to improve efficiency
- Assist with database caching, indexing, and resource allocation
- Administer and optimize environments including user access, roles, and cost controls
- Implement and manage backup and recovery strategies wherever needed and applicable
- Ensure database systems are highly available and resilient, including configuring replication and clustering as needed
- Perform routine maintenance tasks such as patching, upgrades, and storage management
- Automate database operations using scripts and tools
- Implement monitoring solutions to proactively detect and resolve database issues
- Ensure compliance with data security policies and regulatory requirements
- Track cost/usage by team
- Provide guidance and mentorship to junior team members
- Collaborate with software developers, data engineers, and analysts to align database solutions with business needs
- Participate in capacity planning, budgeting, and strategic decision-making for database technologies
- Collaborate with data engineers and analytics teams to assist with ELT/ETL pipelines
- Migrate the data from a Redshift cluster to Snowflake
- Monitor usage and implement cost governance policies for Snowflake workloads
Requirements:
- Bachelor's degree in Computer Science, Information Technology, or related field. Equivalent experience will also be considered
- Minimum 7+ years of experience managing databases, with at least 3 years specializing in MySQL, Snowflake and Redshift Spectrum
- Proficiency in database administration, including replication, clustering, and high availability
- Demonstrated experience running complex database migrations that may have included multiple teams, across accounts, and/or across regions
- Expertise in MySQL, Snowflake, Amazon Redshift and Redshift Spectrum, including data lake integration
- Understanding of SQL and database internals, including storage engines and query execution plans
- Experience with data pipeline tools (e.g., Apache Airflow, AWS Glue)
- Knowledge of monitoring tools (e.g., CloudWatch, Prometheus) and scripting languages (e.g., Python, Bash)
- Familiarity with cloud infrastructure, particularly AWS (S3, Lambda, IAM, etc.)
- Excellent problem-solving and analytical skills
- Strong communication and collaboration abilities
- Ability to work in a fast-paced, dynamic environment and manage multiple priorities
- AWS certifications (e.g., AWS Certified Solutions Architect, AWS Certified Database Specialty)
- Experience with big data technologies such as Hadoop, Hive, or Spark
- Familiarity with DevOps practices, including both infrastructure as code (Terraform) and CI/CD pipelines for database deployments and managing users
- Familiarity with Clickhouse database