Implement technical solutions in Agile iterations, including design, development, testing, deployment, and release, with a focus on scalability, reliability, and security
Design and build observability solutions for cloud-based systems, including metrics, logs, traces, and synthetic monitoring
Develop and enhance integrations with Dynatrace and Catchpoint, including dashboards, alerting, and monitoring configurations
Develop and maintain OpenTelemetry instrumentation and telemetry pipelines to ensure consistent observability across services
Implement infrastructure as code using Terraform or similar tools to provision and manage observability resources
Develop, debug, and build automated tests for backend services and platform components
Monitor and support large-scale production systems, including debugging, performance analysis, and incident response through on-call rotations
Collaborate with software engineers, SREs, and platform teams to improve system visibility, reliability, and performance
Participate in code reviews and contribute to engineering best practices and team improvements
Communicate technical solutions and project status with stakeholders
Efficiently leverage AI-assisted development tools such as Cursor and Claude to improve productivity and automate workflows
Requirements
Bachelor’s degree in Computer Science, Computer Engineering, or related field, or equivalent experience
5+ years of experience in software engineering building cloud-native applications or platforms
Strong programming skills in Java, Go, or Python
Experience building and supporting distributed systems and microservices
Hands-on experience with observability tools such as Dynatrace, Catchpoint, or similar
Experience with OpenTelemetry or distributed tracing concepts
Experience with AWS and Kubernetes-based environments
Experience with infrastructure as code such as Terraform or CloudFormation
Strong understanding of system design, scalability, and reliability
Strong problem-solving skills with the ability to debug complex production issues
Good communication skills and ability to work collaboratively across teams.