Architect and optimize observability environments through software engineering best practices
Build automation scripts and tools to streamline observability instrumentation and deployment
Integrate observability into CI/CD pipelines using infrastructure-as-code (IaC) and configuration management tools
Conduct observability code reviews and implement automated validation checks
Build AI-native tools for autonomous issue detection and resolution
Lead the design of distributed systems focused on reliability, scalability, and performance
Manage data routing, transformation, and endpoint monitoring integrations
Mentor junior engineers and foster a culture of continuous improvement
Collaborate across teams to align observability strategies with business goals
Communicate technical insights and recommendations to leadership
Requirements
Associate degree in Computer Science, Engineering, or related field
7+ years in software/infrastructure engineering with observability/logging focus
Strong foundation in operating systems, networking, algorithms, and distributed systems
Expertise in:
o Programming (eg. Python, .Net, Java, Go, Node.js)
o System architecture and AI Ops
o CI/CD (eg. GitHub Actions, Azure DevOps)
o Infrastructure as Code (eg. Terraform, Ansible)
o Cloud-native development (eg. Azure, AWS, GCP)
o API design (eg. REST, gRPC)
Proven leadership and mentoring experience
Agile delivery experience and strategic thinking around observability maturity
Tech Stack
Ansible
AWS
Azure
Cloud
Distributed Systems
Google Cloud Platform
GRPC
Java
JavaScript
Node.js
Python
Terraform
Go
Benefits
medical, dental and vision benefits
401(k) retirement savings plan
time off (including paid time off, company and personal holidays, volunteer time off, paid parental and caregiver leave)