Title: Enterprise Monitoring Analyst
Position type: Contract
Hybrid: Yes
Location: NYC, NY
Should have had a good experience with any of these monitoring tools like Data Dog/Splunk/ELK/Dynatrace, etc
Communication is the key
The Enterprise Monitoring Analyst is responsible for delivering, supporting, and continuously improving large-scale monitoring and observability solutions across cloud infrastructure, networks, systems, applications, integrations, and business services. This role plays a critical part in ensuring platform reliability, proactive issue detection, and operational excellence by leveraging modern observability platforms and customer-built dashboards.
Role
Key Responsibilities
- Deliver and support enterprise-scale monitoring solutions covering cloud infrastructure, networks, systems, applications, integrations, and business activity.
- Monitor environments using observability platforms and custom dashboards to proactively identify issues and performance anomalies.
- Perform initial triage, analysis, and reproduction of issues, and route incidents to appropriate teams in accordance with established runbooks and escalation procedures.
- Escalate complex issues and collaborate with senior engineers and SMEs for advanced troubleshooting and resolution.
- Support operational performance of monitoring systems, including:
- Deployment of new monitoring products and services
- Management and reduction of false positives and false negatives
- Configuration, release, and change management activities
- Implement and support core applications and services used for monitoring and observability.
- Stay current with emerging monitoring technologies, tools, and industry trends to continuously enhance monitoring capabilities.
- Contribute to service support and service delivery goals, ensuring monitoring platforms meet availability, performance, and reliability expectations.
Required Skills & Experience
- 2+ years of hands-on experience developing application dashboards using Dynatrace.
- 2+ years of experience administering multiple monitoring platforms, such as Dynatrace, Nimsoft, Compuware, or equivalent tools.
- Experience with enterprise monitoring tools including DataDog, Splunk, Dynatrace, ELK Stack, AWS CloudWatch, and PagerDuty.
- Strong understanding of large-scale application architectures, cloud platforms, network architectures, and fault management.
- Solid hands-on experience supporting Windows, Unix, and Linux systems.
- Proven ability to analyze issues, follow runbooks, and execute structured triage and escalation processes.
- Excellent written and verbal communication skills, with the ability to collaborate across technical and non-technical teams.
- Self-driven individual with strong organizational and project management skills.
Preferred Qualifications
- Experience supporting 24x7 production environments in enterprise or SaaS platforms.
- Exposure to cloud-native monitoring and observability in AWS or similar cloud ecosystems.
- Familiarity with ITIL-aligned incident, problem, and change management processes.
- Experience working in a multi-tool, hybrid monitoring environment.
Success Indicators
- Reduced MTTR through proactive monitoring and effective triage
- Improved signal-to-noise ratio by minimizing false alerts
- High reliability and adoption of monitoring dashboards
- Effective collaboration with L2/L3 engineering teams