Cloud Infrastructure Management: Design, deploy, and maintain scalable and resilient infrastructure on AWS using Infrastructure-as-Code (IaC).
Kubernetes Administration: Manage and optimize Kubernetes clusters for containerized applications, ensuring high availability and security.
Automation & CI/CD: Implement and manage CI/CD pipelines for efficient deployment, testing, and monitoring of applications.
Observability & Monitoring: Develop comprehensive monitoring solutions using Prometheus, Grafana, LGTM stack, or similar tools to improve system reliability.
Security & Compliance: Apply best practices for cloud security, IAM policies, and compliance frameworks (SOC2, ISO 27001, etc.).
Incident Response & Performance Optimization: Troubleshoot issues, perform root cause analysis, and implement fixes to optimize performance.
Infrastructure as Code (IaC): Utilize Terraform, Ansible, or similar tools to automate infrastructure provisioning and configuration management.
Collaboration & Knowledge Sharing: Work closely with software engineering, architecture and security teams to promote DevOps culture and best practices.
Disaster Recovery & Reliability Engineering: Design failover and backup strategies to ensure business continuity in the event of failures.

Bachelor’s degree in Computer Science, Engineering, or a related field.
5+ years of experience in cloud infrastructure, SRE, or DevOps roles.
Interest in or any exposure to trading or similar themes would be desirable (not essential)
AWS Certified SysOps Administrator
Associate: desired.
Strong expertise in AWS (EC2, S3, Lambda, RDS, VPC, IAM, etc.).
Hands-on experience with Kubernetes (EKS, K3s, or self-managed clusters).
Proficiency in scripting and automation using Python, Bash, or similar.
Experience with Infrastructure as Code (Terraform, CloudFormation, or Ansible).
Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, Datadog, etc.).
Strong understanding of networking concepts (VPC, Load Balancers, DNS, Firewalls).
Experience with DevOps methodologies, CI/CD pipelines, and GitOps practices.
Experience with high-performance and low-latency (sub millisecond) systems.
Familiarity with serverless architectures and event-driven computing.
Familiarity with Rust compilation processes and techniques.
Willing to collaborate and communicate asynchronously.
Actively look for opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation
Team spirit, ownership, critical thinking
Exposure to cloud cost optimization and FinOps strategies.
Previous exposure working with Crypto, Traditional Finance (Trad Fi) or Trading would be highly desirable but not essential

A competitive salary package, with various benefits depending on method of engagement (Employee vs Contractor)
Autonomy in your time management thanks to flexible working hours and the opportunity to work remotely
The freedom to create your own entrepreneurial experience by being part of a team of people in search of excellence
Continuing Professional Development plan with learning and certification path in accordance with both the team objectives and areas of interests

Site Reliability Engineer

Key skills