
Observability Engineer
REMOTE
PST working hours
Design, deploy, and optimize enterprise observability platforms (e.g., Dynatrace, Tanium) to enable full-stack visibility across infrastructure, applications, and cloud environments
Develop and maintain advanced dashboards, alerting thresholds, and anomaly detection models to support proactive incident identification and AIOps initiatives
Lead root cause analysis for complex, cross-domain incidents by correlating telemetry data (metrics, logs, traces) across distributed systems
Define and enforce observability standards, instrumentation frameworks, and data retention policies across engineering teams
Partner with platform and application teams to embed observability practices into CI/CD pipelines