Ensure production systems in use by the Security Teams operate smoothly, within uptime objectives, and updated with the latest content and functionality
Monitor system performance and capacity
Proactively recommend and implement changes while automating ‘toil’ and repetitive tasks
Architect new and existing systems to enhance performance, reliability, and scalability
Build, implement, iterate over CI/CD pipelines
Assist with the Management, Development, Design, and Deployment of microservice and containerized applications
Implement strong security controls in distributed systems/agents
Coordinate with engineers and developers to automate deployments and configurations across various platforms
Abstract the complexity of Observability implementation by writing scalable automation
Identify opportunities for improvement around observability and process
Standardization and development of alerts/notifications and response to monitoring tools
Work alongside application teams to implement Observability in day-to-day operations
Contribute to post-mortems and provide root cause analysis and implementation of resulting action items
Promote DevOps best-practices within the team
Participate and promote Agile/Scrum
Contribute to hybrid cloud production containerization service offering
Design and implement standards, policies, and procedures for automation and integrations
Requirements
Bachelor’s Degree with 7 years’ experience; Master’s Degree with 6 years’ experience; PhD with 2 years’ experience
Treat best practices for security as a requirement, not an afterthought
Knowledge of Cloud Platform administration (AWS, GCP, Azure)
Familiarity with Observability pillars
Experience in working in high-scale environments and understanding of distributed architectures
Knowledge of Agile / DevOps methodologies
Experience with CI/CD tools (Github Actions, Bamboo, Jenkins, Azure DevOps)
Familiarity with running docker workloads using orchestration tools (Kubernetes / Amazon ECS)
Ability to work both independently without direction and within a group for day-to-day activities
Passion for learning new concepts and processes quickly, and adapting to a changing environment
Comfortable working in and administering Linux and Windows environments
Preferred: Exposure and implementation of SPIRE/SPIFFE
Direct experience with Terraform/Crossplane
Proficiency working with development tools and scripting languages (git / mercurial / subversion; Python / Elixir / Go)
Integrating MCP Servers with authorization controls
Knowledge of database management systems (NoSQL, Relational Databases, and associated query languages)