Loft Orbital is revolutionizing access to space by building reliable, shareable satellites that drastically reduce the time and complexity traditionally required to get to orbit. As a Senior Site Reliability Engineer on our Cloud Infrastructure Team, you’ll play a pivotal role in maintaining and scaling our ground segment infrastructure.
Responsibilities:
- Collaborate with developers and satellite operators to foster a strong SatDevOps culture
- Design, implement, and maintain scalable, reliable, and secure infrastructure in a hybrid cloud environment
- Improve our developer experience by building better tools, workflows, and environments
- Lead efforts to automate and optimize systems, including CI/CD pipelines, infrastructure provisioning (IaC), and deployment workflows
- Own and evolve our observability stack (metrics, tracing, logs) to improve usability and performance. Grafana-centric ecosystems are a plus
- Implement and advocate for best practices in software reliability, fault tolerance, and performance tuning
- Proactively identify, investigate, and resolve system reliability issues, performing root cause analyses and implementing long-term fixes
- Partner with teams to design and operate Software Defined Network (SDN) solutions
- Contribute to a collaborative and inclusive team culture where respectful debate and continuous learning are celebrated
Requirements:
- Strong experience with public cloud infrastructure, ideally GCP
- Deep expertise in Kubernetes, architecture, deployment, ops, and resource optimization
- Demonstrated ability to design and build scalable, highly available systems
- Familiarity with Software Defined Networking (SDN) concepts and tools
- Experience implementing and maintaining observability stacks (Grafana, Prometheus, Loki, etc.)
- Proficiency in at least one backend language: Go, Python, Rust, C/C++, or Java
- Deep understanding and hands-on experience with DevOps practices: CI/CD, infrastructure as code (IaC), and automation
- Proven track record of working in fast-paced, high-growth technical environments
- Excellent problem-solving skills and ability to operate independently with a proactive, results-driven mindset
- Strong communication skills; thrives in a multicultural, cross-functional team
- Master's degree in Computer Science or a similar field
- Hands-on experience with GitOps frameworks (ArgoCD, FluxCD)
- Interest or experience in FinOps and cost-optimized architectures
- Understanding of orchestration in resource-constrained environments, like space systems
- Knowledge of systems engineering tools and SDLC governance
- Familiarity with security practices, vulnerability scanning, threat detection, risk mitigation