Troubleshooting of database and infrastructure issues on services you provide
Manage capacity and availability to ensure resilience and cost effectiveness
Provide input into the design, development and implementation of automation and tooling for data engineering teams to achieve their goals.
Work closely with peers in data engineering teams to implement solutions that are scalable, secure, and easily maintained.
Disaster Recovery planning, testing and execution in combination with engineering teams
Design, implement and run resilient highly available solutions
Implement monitoring, health check scripts, functional and load testing
Use modern ‘infrastructure as code’ and orchestration tools to reduce manual operations where practically possible
Plan, organize and communicate project status
Share knowledge and skills with other team members
Requirements
Excellent knowledge of Public Cloud technology – we use AWS with RDS, S3, ES, API Gateway, Redshift, Lambda, SQS & EC2 being the core services we depend on.
Excellent knowledge of Database Technologies & Architectures DBA expertise supporting large scale databases such as SQL Server, Redshift or PostgreSQL; preferably data warehouses
AWS certified Engineer or equivalent commercial experience
Some experience developing scalable and resilient APIs, systems and services for cloud platforms
Strong understanding of techniques and strategies for maintaining high availability
Strong experience in automated deployment of infrastructure and applications
Strong scripting/programming skills in at least one language such as Python and PowerShell
Experienced in collaborative development – performing code reviews and automating via deployment pipelines, such as Jenkins, Gitlab CI or AWS CodePipeline
Excellent Windows & Linux systems knowledge and great application troubleshooting skills
Good knowledge of log, monitoring and alerting platforms such as AWS CloudWatch, Spotlight, ELK, Prometheus
Good understanding of Security and Compliance challenges and how to reduce scope and increase protection – we deal with PCI, SOX and GDPR on some of our systems
Technical writing skills for documenting environments and procedures
Self-motivated, energetic, and tenacious.
Able to work as part of a team as well as independently.
Strong organizational skills and time management.
A desire to learn and use a broad range of skills in a highly complex environment.
Excellent analytical, problem solving and resolution skills.
Passionate about automation and tooling.
Tech Stack
Amazon Redshift
AWS
Cloud
EC2
Firewalls
Jenkins
Linux
Postgres
Prometheus
Python
SQL
Benefits
Medical, Vision and Dental benefits for you and your family, including Flexible Spending Accounts (FSA) and Health Savings Accounts (HSAs)
Generous paid time off policy including paid holidays, sick time and paid days off for your birthday
Free concert tickets
401(k) program with company match
Stock Program
New parent programs & support including caregiver leave and childcare cash
Infertility support
Tuition reimbursement
Student loan repayment
Internal growth and development programs & trainings