IT Project Manager / Observability
Should have used Dynatrace Observability
Remote
Exp: 12+yrs
Note:
- Lead operationalization and migration efforts for Dynatrace-related initiatives
- Identify and define opportunities for process automation
- Develop, audit, and maintain runbooks for long-term scalability
- Implement tools and establish sustainable operating processes
- Partner across teams to drive communication, adoption, and execution
- Experience in supporting organizational transformation tied to tool migration and process maturity
- Bring strong IT background; Dynatrace experience is a plus
- Focus on setting up solutions that are effective and sustainable long term
Experience leading agent/platform upgrades across Linux, Windows, and Kubernetes
Project Leadership: own plans, RAID logs, executive communication
Middle piece between engineers, app owners, and leadership
Required Experience:
- Strong background in Project/Program Management within Infrastructure, Operations, or SRE environments
- Experience supporting incident management and/or change management processes (ITSM/CMDB practices)
- Familiarity with observability concepts and tooling (e.g., Dynatrace, BigPanda)
- Proven ability to operate at a strategic level, connecting initiatives rather than owning deep technical execution
-Experience managing multiple concurrent workstreams in fast-paced enterprise environments
- Exposure to enterprise monitoring and event management platforms
- Experience driving initiatives focused on uptime, reliability, and zero-downtime environments
Job Description
Ideal candidate Seeking a hands-on Technical Project Manager III to lead Dynatrace OneAgent upgrade work and associated Global Settings workstreams across United’s cloud and on-prem environments. This role will be responsible for coordinating cross-functional stakeholders (application owners, platform teams, Dynatrace vendor partners), driving the project into production readiness, and operationalizing a repeatable, low-risk cadence for future upgrade cycles and tenant/global configuration changes.
Job Summary / What you’ll do This PM III will manage two closely related initiatives:
A. Dynatrace OneAgent Upgrade (primary): Lead the end-to-end upgrade project for Dynatrace OneAgent (Linux & Windows hosts and Kubernetes Operator paths). Pre-upgrade testing and validation will be completed prior to the PM’s start so the PM can hit the ground running on Day One. Responsibilities include owning the project plan and phased execution (validated staging → controlled production rollout → validation and operationalization); coordinating application teams; orchestrating scheduled patch/reboot windows; validating telemetry and instrumentation post-upgrade; and finalizing runbooks and handoff artifacts for operations.
B. Dynatrace Global Settings (Platform Enablement): Lead planning and execution for updating global Dynatrace settings across tenants (lower tenants → PROD). The global configuration change itself is executed one time per tenant and is typically completed quickly; the substantive work is coordination: scheduling and tracking required server/host restarts by application teams over designated change nights, communicating windows and risks, collecting application-team confirmations or exception reports, and performing post-restart validation. The PM must manage communications, capture reboot/participation confirmations, and track/triage any post-change issues.
Key responsibilities
· Independently run the OneAgent Upgrade project: phased rollout, automated and manual validation, issue triage, rollback execution and remediation.
· Lead Global Settings delivery: sequence changes by tenant, define release windows, validate minimal production impact, and produce an enterprise standard for global configuration management and minimal customization.
· Maintain artifacts: project plans, RAID, dependency trackers, decision logs, runbooks and SteerCo reporting.
· Communicate to application teams, platform engineering, support, and executive stakeholders; prepare concise SteerCo materials and communicate risks early.
· Deliver operational handoff: validated runbooks, acceptance criteria, automation scripts or configuration-as-code where applicable, and a repeatable cadence for future upgrades and global settings updates.
· Tracking & reporting (critical): own project-level tracking and reporting artifacts — integrated status (weekly), an operational readiness dashboard, RAID logs and dependency trackers — and provide daily status reporting during change windows. Produce SteerCo-ready materials and a 24-hour post-change validation report following each major change window.
Qualifications / Experience (non-cumulative) • 3+ years in technical project management; demonstrable experience leading observability/monitoring platform upgrades or equivalent platform change programs. • Demonstrated experience delivering enterprise upgrades across Linux, Windows and Kubernetes (including Operator upgrades). • Strong competency in change and release management for production systems (patch cycles, planned host restarts, change windows).
- Manage and coordinate multiple initiatives across incident management and observability programs
- Provide big-picture oversight across projects, identifying dependencies and ensuring alignment between parallel efforts
- Support ongoing and upcoming project phases
- Partner with internal stakeholders to connect workstreams and highlight interdependencies across teams
- Drive improvements tied to MTTR (Mean Time to Resolution), change success rates, and overall system reliability
- Facilitate communication, status reporting, and executive-level updates
- Help stand up new workstreams and close out completed efforts