13.04.21
Ray is a framework that makes distributed computing using Python easy to set up and run.
Often, when handling large datasets or models, it is useful to run processes on multiple computers. This can speed up training models by parallelising things like hyper-parameter search or batch gradient descent. There are a number of ways of doing this, but they all tend to be fairly difficult to set up and require significant changes to single threaded code.
Ray makes this easier. By adding a couple of lines of code it it possible to run code on a cluster of machines running in the cloud.
Why this matters: distributed computing can be done quickly and easily without changing existing code (much).