Independent Infrastructure Ownership: Propose and implement architectural approaches for cloud infrastructure components across Azure and GCP, working independently with limited supervision.
Infrastructure as Code (IaC): Own the lifecycle of environments using Terraform and manage complex Kubernetes application deployments via Helm.
Security & Compliance: Implement Best Cloud Security Practices to protect sensitive data, ensuring adherence to strict government and enterprise security mandates.
Reliability & Performance: Develop solution resilience and reliability mechanisms (observability, auto-scaling, self-healing) to enhance the performance of the job area.
Technical Coaching: Act as a trusted source for code and infrastructure reviews; coach and review the work of lower-level professionals to ensure high-quality standards.
Collaborative Triage: Support and triage complex production issues, coordinating with software teams to optimize Java-based microservices and resolve performance bottlenecks.
Operational Excellence: Work to achieve operational targets with a significant impact on departmental results, including uptime, deployment velocity, and cost-efficiency.
Requirements
Bachelor’s Degree with 4+ years of devops, SRE, or systems engineering experience AND 2+ years of Java experience
Must be a U.S. citizen with the ability to obtain necessary security clearance as required by government contract.