Mount Laurel, New Jersey, United States of America
Full Time
3 hours ago
$200,000 - $280,000 USD
Visa Sponsor
Key skills
SplunkAIDatadogDynatraceAgile
About this role
Role Overview
Define and execute the enterprise strategy for Observability, AIOps, SRE, and incident intelligence.
Lead global teams responsible for the platforms, practices, and product roadmap that enable 24x7 operational visibility and resilience across critical technology services.
Drive the next phase of transformation from reactive monitoring to predictive, AI-assisted operations by advancing telemetry standards, improving signal quality, and scaling automation across the incident lifecycle.
Partner closely with engineering, platform, and application teams to embed observability into design, build, and run practices, while also leading vendor strategy, platform rationalization, and cost optimization.
Build trusted partnerships across engineering, infrastructure, architecture, risk, and control functions to deliver secure, stable, and scalable operational outcomes.
Establish governance and guardrails for AI.
Improve service availability and reduce customer-impacting outages.
Enable faster recovery and proactive issue prevention.
Enhance reliability of digital and core banking platforms.
Requirements
Undergraduate degree or Technical Certificate
15+ years of development and technology delivery experience with Agile Delivery Experience preferred
Strong experience with APM, Event management, Operational Automation and Reporting Platforms Including Dynatrace, Datadog, Splunk, PagerDuty, RunDeck, PowerBI.
Demonstrated ability to influence senior stakeholders, lead through ambiguity, align teams to measurable outcomes, and create a culture of accountability, innovation, and continuous improvement.
Tech Stack
Splunk
Benefits
Health and well-being benefits
Savings and retirement programs
Paid time off (including Vacation PTO, Flex PTO, and Holiday PTO)