Designing, developing, testing, and operating critical software, services, and tools the ACDC team is responsible for
Designing and implementing enhancements to ACDC observability infrastructure in order to identify and correct problems before they impact our customers.
Developing subject matter expertise in ACDC components and mentoring team members
Identifying and implementing automation best practices for existing products and processes
Collaborating with our partner engineering teams to investigate and troubleshoot complex problems
Participating in on-call rotations, guiding restoration and repair of service-impacting issues
Requirements
Possess advanced level experience with designing, developing, and deploying software and infrastructure at scale
Demonstrate advanced experience in a site reliability or software engineering role, working with large-scale distributed systems.
Utilize tools such as SaltStack, Terraform, Ansible, Chef, or Puppet to manage infrastructure as code effectively and efficiently.
Have relevant experience and a Bachelor's degree in Computer Engineering, Computer Science or equivalent