8 months ago
Responsibilities
- Develop high-quality open-source software to simplify distributed programming with Ray.
- Identify and implement architectural improvements to Ray core and Datasets.
- Improve the testing process for Ray to ensure smooth releases.
- Communicate your work through talks, tutorials, and blog posts.
- Work on performance optimization of Ray Datasets at large scale.
- Integrate ML training and data sources with Ray.
- Lead efforts to integrate streaming workloads into Ray.
Requirements
- At least 5 years of relevant work experience.
- Solid background in algorithms, data structures, and system design.
- Experience in building scalable and fault-tolerant distributed systems.
- Familiarity with data processing and database internals, including Spark or Dask.
- Experience with streaming data processing is a plus.
