Responsible for building and operating the infrastructure that enables BJAK’s engineering teams to ship reliably, securely, and at scale
Work closely with developers to design deployment workflows, maintain production systems, and continuously improve reliability, performance, and security across our cloud environments
Design, build, and maintain CI/CD pipelines to support automated build, test, and deployment workflows
Manage, operate, and scale Kubernetes clusters (EKS, GKE, AKS, or self-managed)
Provision and maintain cloud infrastructure using Infrastructure as Code (Terraform, Pulumi, CloudFormation)
Improve system reliability, performance, availability, and scalability
Implement and maintain monitoring, logging, and alerting systems (e.g. Prometheus, Grafana, ELK, Datadog)
Support zero-downtime deployments, rollbacks, and release strategies
Troubleshoot production issues including latency, errors (e.g. 502s), pod crashes, OOMs, and networking problems
Implement and enforce security best practices, including secrets management, IAM, and network policies
Collaborate closely with software engineers on deployment strategies, system design, and operational readiness
Requirements
Bachelor’s degree in Computer Science or equivalent practical experience
Strong experience with Linux environments and shell scripting
Hands-on experience with Docker and Kubernetes
Experience working with at least one major cloud platform (AWS, GCP, or Azure)
Solid understanding of CI/CD systems (GitHub Actions, GitLab CI, Jenkins, etc.)
Experience implementing Infrastructure as Code
Good understanding of networking fundamentals (DNS, TCP/IP, load balancing)
Comfortable debugging and resolving production issues
Strong understanding of performance, scalability, and reliability principles
Experience with backend or web application environments
Proficiency in at least one scripting language such as Bash or Python
Ability to work independently, take ownership, and drive initiatives end-to-end
Tech Stack
AWS
Azure
Cloud
DNS
Docker
Google Cloud Platform
Grafana
Jenkins
Kubernetes
Linux
Prometheus
Python
Shell Scripting
TCP/IP
Terraform
Benefits
Work on production systems that operate at real scale across multiple markets
High ownership role with meaningful impact on reliability, security, and developer velocity
Collaborate closely with experienced engineers in a fast-moving environment
Flat structure where execution and results matter more than titles
Opportunity to build, improve, and own critical infrastructure end-to-end
Competitive compensation and long-term growth opportunities