Job Description: Observability Engineer
SRE background required, AWS, Python/Java, Expertise in observability tools like Splunk, New Relic, Observe (Must have) Targeting 4-6 months of experience with OBSERVE but at least 4-5 years overall SRE experience
Working on journey mapping on DFS intent
Most of the work (60-70% of work) is building dashboards and collaborating with team on improving dashboards, understanding data sets and recommending improvements and data import strategies this includes data modifications and clean ups
Job Title: Observability Engineer (Contractor) strong observability knowledge and experience with the OBSERVE tool is KEY here
Focus: Full-Stack Observability, System Traceability, & Executive Health Scoring
Role Summary
We are seeking a hands-on Observability Specialist to accelerate the adoption of our Observe based platform. The ideal candidate possesses an SRE mindset the ability to explore how complex systems interact and identify the exact data sets needed to provide a 360-degree view of the environment. You will bridge the gap between disparate Lines of Business (LOBs) to build E2E traceability and unified "Health Indices" that reduce mean-time-to-detect (MTTD) from hours to minutes.
Technical Skill Requirements
1. Core Observability & Tooling
Platform Expertise: Deep experience with modern observability platforms. While we use Observe, proficiency in New Relic, Splunk, or Databricks is required for rapid ramp-up.
Query & Data Fluency: Expert-level ability to write complex queries (SQL-based or proprietary like NRQL/SPL) to aggregate API success rates, latency, and crash-free session data.
Dashboard Architecture: Proven track record of building "Drill-Down" architectures moving from high-level user journeys (e.g., Login) directly into microservice-level logs and traces.
2. The Modern Tech Stack
Infrastructure: Hands-on experience with AWS (ECS/Fargate/Lambda) and Docker.
Languages: Ability to navigate and instrument code in Python or Java. Not doing hands on coding ability to read/re-engineer and understand if instrumentation is correct
Integrations: Familiarity with GraphQL for data fetching and Jenkins for CI/CD pipeline monitoring.
Instrumentation: Hands-on experience with OTel, and familiarity with NewRelic APM or Datadog APM
3. SRE & Systems Architecture Mindset
Cross-Domain Traceability: Experience monitoring digital customer engagement across disparate system boundaries (e.g., Comms, Phone, and Backend APIs) to expose "silent failures."
Telemetry Mapping: Ability to map technical metrics to business outcomes, specifically creating Unified Health Indices for Senior Leadership (SLT)Root Cause Analysis (RCA): Skill in configuring alerts and correlations that enable instant pinpointing of failures within complex user flows.
Equal Opportunity Employer/Veterans/Disabled
Benefit offerings available for our associates include medical, dental, vision, life insurance, short-term disability, additional voluntary benefits, an EAP program, commuter benefits, and a 401K plan. Our benefit offerings provide employees the flexibility to choose the type of coverage that meets their individual needs. In addition, our associates may be eligible for paid leave including Paid Sick Leave or any other paid leave required by Federal, State, or local law, as well as Holiday pay where applicable. Disclaimer: These benefit offerings do not apply to client-recruited jobs and jobs that are direct hires to a client.
To read our Candidate Privacy Information Statement, which explains how we will use your information, please visit .
The Company will consider qualified applicants with arrest and conviction records in accordance with federal, state, and local laws and/or security clearance requirements, including, as applicable:
The California Fair Chance Act
Los Angeles City Fair Chance Ordinance
Los Angeles County Fair Chance Ordinance for Employers
San Francisco Fair Chance Ordinance