Loadsmart is a growth-stage technology company valued at over $1 billion, focused on reinventing the future of freight through innovative technology. The Senior Site Reliability Engineer will design infrastructure and platform architecture, automate CI/CD pipelines, and troubleshoot complex security and networking issues.
Responsibilities:
- Design infrastructure, networking, and software platform architecture
- Define platform guidelines, requirements, and processes aligned with DevOps practices
- Build and maintain infrastructure automation using Infrastructure as Code (IaC)
- Ensure auditable delivery of infrastructure definitions and changes
- Automate Continuous Integration and Continuous Deployment (CI/CD) pipelines
- Drive Developer Experience and productivity initiatives, including service catalogs and service maturity
- Build and maintain the shared application platform used by all engineering teams
- Operate and manage multiple Kubernetes clusters
- Design, develop, and maintain core systems using common programming languages
- Build and maintain internal tooling used across engineering teams
- Troubleshoot infrastructure, internal applications, networking, and security issues
- Build and maintain observability platforms, guidelines, and standards
- Define and manage internal platform SLIs, SLOs, and SLAs
- Manage backup policies and operations
- Maintain database fleets, including upgrades, security patches, performance tuning, and troubleshooting
- Conduct security risk assessments, vulnerability scans, VPN configuration, and security testing
- Utilize tools and technologies including Linux, Python, Go, JavaScript, and shell scripting
Requirements:
- Bachelor's degree or foreign equivalent in Computer Science, Computer Engineering, or Information Technology
- 2+ years of experience in the role offered or 2+ years of experience as a Reliability Engineer, Cloud Engineer, Software Engineer, or in a related occupation