Anyscale is on a mission to democratize distributed computing and make it accessible to software developers of all skill levels. They are seeking a Software Engineer to improve the performance of Ray Data, focusing on building data loading solutions and ensuring stability and fault tolerance in AI workloads.
Responsibilities:
- Improve the performance of Ray Data and multi-modal batch inference use cases
- Ensure efficient scaling across different stages of the Data pipeline in a heterogeneous environment
- Building data loading solutions for production training workloads
- Focus on stability and fault tolerance at high scale
- Working with customers and new age AI native companies in scaling their AI workloads
Requirements:
- At least 3-4 years of relevant work experience
- Solid background in building scalable and fault-tolerant distributed systems
- Experience with data processing, database internals
- Passionate about large scale systems and performance for AI