The Home Depot is a leading home improvement retailer, and they are seeking a Senior Systems Engineer to develop, maintain, and support their technical infrastructure. This role involves collaborating with product and project teams, supporting technology architecture design, and ensuring operational stability of systems and infrastructure.
Responsibilities:
- Keeps abreast of innovations and industry trends as well as changes to internal systems and determines how they impacts tools, training, and support necessary to keep systems up, running, and secure; Participates in and contributes to learning activities around modern systems engineering core practices (communities of practice); Proactively views articles, tutorials, and videos to learn about new technologies and best practices being used within other technology organizations
- Keeps abreast of innovations and industry trends as well as changes to internal systems and determines how they impacts tools, training, and support necessary to keep systems up, running, and secure; Participates in and contributes to learning activities around modern systems engineering core practices (communities of practice); Proactively views articles, tutorials, and videos to learn about new technologies and best practices being used within other technology organizations
- Researches and analyzes business trends and behavioral data to identify opportunities for improvements and new initiatives; Drives the evaluation, development, and recommendation of specific technology to provide cost-effective solutions that meet THD requirements; Researches and designs best fit infrastructure, network, database, cloud, AI, and security architectures for products; Proactively creates and maintains tools for monitoring and support; Participates in project planning and reporting across multiple efforts
- Collaborates with product and project teams to understand needs and enable them with infrastructure; Supports technology architecture design review efforts for project and product teams; Leverages tooling and custom applications to monitor the operational status of applications, infrastructure, networks, databases, and security; optimizes and tunes performance as appropriate; Drives root cause analysis, debugging, support, and post-mortem analysis for security incidents and service interruptions; Maintains, upgrades, and supports existing systems and infrastructure to ensure operational stability; Opens and manages vendor problem tickets to resolution; Drives the production of in-house documentation around solutions; Provides application support for software running in production; Drives moving KB articles to infrastructure as code models; Drives keeping monitoring/alerting up to date
Requirements:
- Must be eighteen years of age or older
- Must be legally permitted to work in the United States
- The knowledge, skills and abilities typically acquired through the completion of a bachelor's degree program or equivalent degree in a field of study related to the job
- 4 years of work experience
- Professional or educational experience as an Information Technology Engineer
- Experience working as part of a collaborative, cross-functional, modern engineering team
- Experience in troubleshooting and remediation within multiple Information technology disciplines
- Experience installing and upgrading applications or databases and performing system maintenance
- Familiarity with system and environment analysis, design, and optimization
- Familiarity with debuggers, runtime analysis, library systems, compiled programming, and software update tools
- Experience monitoring the operational status and performance of, and configuring as well as tuning, systems, networks, or databases
- Experience with operating system commands and utilities as well as scripting
- Experience with cloud platforms such as GCP and Azure
- Experience supporting a 24x7 retail operation
- Experience with version control systems
- Experience with CI/CD toolchain
- Experience with production system designs including Infrastructure as Code, High Availability, and Performance monitoring
- Exposure to Site Reliability Engineering (SRE)