Voleon is a technology company that applies state-of-the-art AI and machine learning techniques to real-world problems in finance. As a Site Reliability Engineer (SRE), you will work at the intersection of production operations and software development to improve, manage, and monitor production-critical infrastructure and data pipelines.
Responsibilities:
- Improve fault-tolerance and maintainability of code in proprietary data pipelines and trading systems
- Diagnose and fix bugs in code
- Lead complex deployments
- Automate manual workflows
- Track and prioritize outstanding production-related issues
- Share an on-call rotation responding to incidents to ensure the continuous operation of production-critical systems
Requirements:
- Experience with coding and debugging Python
- Experience with Linux
- Familiarity with Relational Databases & SQL
- Sharp analytical and problem-solving skills
- Strong growth mindset and a passion for learning
- Strong technical communication skills
- Attention to detail
- 2 years of relevant industry experience
- An undergraduate degree in a quantitative field or equivalent experience
- Familiarity with best practices concerning code maintainability, documentation, quality assurance, continuous integration and deployment
- Experience supporting production systems
- Experience with any of the following: gRPC microservices, Postgres, Pandas, Golang, R, Git, Jenkins, Bazel, Prometheus, Grafana, Airflow, Kubernetes