AlphaSense is a company that provides AI-driven market intelligence to help professionals make informed decisions. They are seeking a Staff Software Engineer for their Core Cloud Platform team, responsible for managing cloud infrastructure and leading projects to ensure scalability and performance. The role involves collaboration with various engineering teams and the implementation of AI-friendly infrastructure solutions.
Responsibilities:
- Own, operate and evolve our Cloud and Kubernetes based Platform
- Continuously improve the security, reliability, cost efficiency and performance of our infrastructure
- Champion standards and drive initiatives to promote uniformisation and eliminate duplication
- Collaborate with Product teams to enable and empower them to build and own their services
- Work closely with other Platform teams (e.g. SRE, DevEx) to create a coherent and quality Platform
- Enable truly self-service and agentic workflow patterns that support a Build it - Own it model
- Build tooling and AI powered integrations that reduce the cognitive load of dealing with complex infrastructure operations and systems
- Provide support for the systems we build to a globally distributed Engineering organisation
- Execute, automate and support operational tasks on complex and mission critical systems
- Architect, design and implement simple and maintainable solutions
Requirements:
- Extremely experienced in Kubernetes and its various components, containerisation and AWS cloud
- Proven track record of running Kubernetes and Cloud based production SaaS systems at scale
- Deep understanding of networking and Production level usage of service meshes and gateways
- Strong operational and troubleshooting skill with Kubernetes, Databases, Messaging Systems and Storage layers
- Proficiency in Python and Golang, preferably demonstrated through building infrastructure tooling, Kubernetes operators or control plane components
- Evangelist of highly performant CI/CD practices that are supportive of agentic workflows
- Ability and desire to design, lead, influence and deliver on ambitious technical roadmaps
- Management of large fleets of Kubernetes clusters with lifecycle management tools (e.g. CAPI, Rancher)
- Production usage of Hashicorp vault or similar secret management systems
- GCP and Azure knowledge