AnsibleAWSAzureCloudDistributed SystemsDockerJavaJenkinsJMeterJUnitKubernetesMavenPythonSplunkECSEKSAKSCloudWatchDatadogELK StackGitHubCI/CDLeadershipRemote Work
About this role
Role Overview
Support the resilience and observability of Workplace Investment (WI) applications across pre-production and non-production test environments
Drive monitoring initiatives across WI product lines—including Health Care (HC) and Stock Plan Services (SPS)
Collaborate with development, engineering, and operations teams to identify and address root causes
Report regularly to leadership and partner with Enterprise Infrastructure (EI) to align monitoring practices with production standards
Requirements
Bachelor's or master's degree in computer science, Software Engineering, or a related technical field
9+ years of experience supporting and operating complex, large scale production environments
Deep technical expertise and extensive hands-on experience with monitoring and observability platforms, including CloudWatch, Datadog, Splunk, and the ELK stack
Strong proficiency in distributed systems and technologies, including Java, Python, and cloud platforms such as AWS and/or Azure
Experience working with both batch processing and online transaction processing (OLTP) applications
Proven hands-on experience with CI/CD and DevOps tooling, including GitHub, Jenkins, Concourse, Ansible, Maven, JUnit, Docker, JMeter, Artifactory, Sonar, Veracode, Kubernetes, and UDeploy
Practical experience with container orchestration platforms, including Kubernetes, AKS, EKS, and ECS