3Core Systems, Inc is seeking a skilled Jaeger Analytics / Development Engineer to design, implement, and optimize distributed tracing and observability solutions using Jaeger. The ideal candidate will work closely with DevOps, SRE, platform engineering, and application teams to enhance system visibility and performance monitoring across distributed microservices environments.
Responsibilities:
- Design, deploy, and maintain distributed tracing infrastructure using Jaeger
- Integrate tracing instrumentation into microservices, APIs, and backend systems
- Develop custom analytics dashboards and telemetry reporting solutions
- Collaborate with engineering teams to improve application observability and performance
- Analyze trace data to identify bottlenecks, latency issues, and service dependencies
- Configure and optimize telemetry pipelines using tools such as OpenTelemetry, Prometheus, and Grafana
- Build automation scripts and CI/CD integrations for monitoring deployments
- Implement alerting, monitoring standards, and observability best practices
- Support incident troubleshooting and root-cause investigations
- Document architecture, configurations, and operational procedures
Requirements:
- 12-15 years of work experience
- Skilled in designing, implementing, and optimizing distributed tracing and observability solutions using Jaeger
- Experience working closely with DevOps, SRE, platform engineering, and application teams
- Ability to improve system visibility, performance monitoring, root-cause analysis, and telemetry analytics across distributed microservices environments
- Experience in software development, observability engineering, telemetry analytics, and platform integration
- Ability to design, deploy, and maintain distributed tracing infrastructure using Jaeger
- Experience integrating tracing instrumentation into microservices, APIs, and backend systems
- Ability to develop custom analytics dashboards and telemetry reporting solutions
- Experience collaborating with engineering teams to improve application observability and performance
- Ability to analyze trace data to identify bottlenecks, latency issues, and service dependencies
- Experience configuring and optimizing telemetry pipelines using tools such as OpenTelemetry, Prometheus, and Grafana
- Ability to build automation scripts and CI/CD integrations for monitoring deployments
- Experience implementing alerting, monitoring standards, and observability best practices
- Ability to support incident troubleshooting and root-cause investigations
- Experience documenting architecture, configurations, and operational procedures