Ray: Easier distributed computing

13.04.21

Ray: Easier distributed computing

Ray is a framework that makes distributed computing using Python easy to set up and run.

Often, when handling large datasets or models, it is useful to run processes on multiple computers. This can speed up training models by parallelising things like hyper-parameter search or batch gradient descent. There are a number of ways of doing this, but they all tend to be fairly difficult to set up and require significant changes to single threaded code.

Ray makes this easier. By adding a couple of lines of code it it possible to run code on a cluster of machines running in the cloud.

Why this matters: distributed computing can be done quickly and easily without changing existing code (much).

←Previous: Billion dollar computer vision startups

Next: Andrew Ng says: "Sort out your data!"→

Ray: Easier distributed computing

Keep up with the latest developments in data science. One email per month.