Teladoc Health is a leader in virtual care, dedicated to transforming how better health happens. The Manager, DevOps Engineering will lead a team to build and manage cloud infrastructure and application services, ensuring operational excellence and high availability.
Responsibilities:
- Work with engineering development leadership to build shared cloud infrastructure and application services that meet the requirements and needs of the commercial platform and application teams
- Ensure services are designed with 24/7 availability and operational maturity
- Lead, manage and inspire a team of DevOps engineers
- Build and maintain a highly effective release management system
- Assign and monitor the work of technical personnel, ensuring that application infrastructure development and deployment is done in the best possible way, and implement quality control and review systems throughout the development and deployment processes
- Conduct periodic reviews with key stakeholders
- Identify, evaluate, and execute preventive measures to minimize/avoid impact to the customer's experience
- Analyze and review application performance against the Technical Operations SLA
- Work with cross-functional business teams to understand requirements and other performance SLAs
- Collaborate with Product and Customer Support teams to plan and deploy product releases
- Implement proactive monitoring, alerting, trend analysis and self-healing systems
- Initiate and drive service improvement plans, collaborating with our SRE and NOC teams
- Manage analysis and approval of new code through security and performance gates that the position will design and develop for feature-complete software
- Be an advocate for security and performance standards in the organization
- Manage operational aspect of production and development servers including developing, training in, and validating compliance with procedures and checklists related to disk space usage, monitoring solutions, deployment, conventions, access to the production and development sources, source control access and usage, performance monitoring, code modifications validation, scheduling, and more
- Responsible for the team's adherence to ITIL and other audited regulatory compliance (HiTrust, FedRAMP) programs
- Update management in case of critical incidents and act as a point of contact for other related communications
- Work within IT and Technical Operations, business stakeholders and with vendors, to successfully identify, prioritize, and resolve issues and provide subject matter expertise for enhancements, developments, operational improvements to the website applications that Teladoc Health relies on
- Ensure system technical specifications, process, procedures, runbooks, SOPs, and infrastructure requirements definitions based on conceptual design are maintained and high-quality for all technical and non-technical deliverables to a wider organizational audience and for T3-4 level support of production workloads
- Ensure the development of our SaaS public and private cloud infrastructure is conducted in cost-conscious ways, using 'infrastructure as code' and automated environment provisioning tools and techniques
- Participate in the full software and infrastructure development life cycle process including requirements analysis and design implementation
- Consult with the Product Management team to prototype, refine, test, and ship products to meet business needs
- Consult and participate in implementation of new customer features, products, and utilities
- Identify and evaluate new technologies for implementation
- Analyze user needs to determine technical requirements
- Collaborate with the Operational, Development and Architectural teams to ensure operational maturity requirements (reliability, availability, scalability, observability, performance, capacity etc.) are met, and recommend operational improvements to them
- Ensure internally developed and externally acquired custom off the shelf solutions are appropriately instrumented and monitored via the Teladoc monitoring standards
- Perform managerial work to accomplish tasks and projects within defined timelines and in a professional manner in alignment with active standards
Requirements:
- Bachelor's degree, or foreign equivalent in Computer Engineering, Electrical Engineering or a related field
- 8 years of experience as a Technology Lead, Azure Infrastructure Consultant, Microsoft Build Master, Lead Engineer, or other occupation involving Software within the Service (SaaS) industry
- Professional experience with Version management and ticketing systems such as Git and Jira
- Professional experience with Deploying, operating and troubleshooting web application software on Unix/Linux systems
- Professional experience with Continuous integration, testing and deployment with tools such as Jenkins, Azure DevOps, Bamboo CI, etc
- Professional experience with Working with large scale infrastructure in AWS or Azure public cloud
- Professional experience with Designing systems with high availability and disaster recovery
- Professional experience with Monitoring, metrics and visualization tools such as New Relic, Sensu, Nagios, etc
- Professional experience with Unix/Linux system administration and troubleshooting
- Master's degree, or foreign equivalent in Computer Engineering, Electrical Engineering or a related field
- 6 years of experience as a Technology Lead, Azure Infrastructure Consultant, Microsoft Build Master, Lead Engineer, or other occupation involving Software within the Service (SaaS) industry