In this tutorial I'll show you building a movie recommendation service with Apache Spark. Two users are alike if they rated a product similarly. For example, if Alice rated a book 3/5 and Bob also rated the same book 3.3/5 they are very much alike. Now if Bob buys another book and rates it 4/5 we should suggest that book to Alice, that's what a recommender system does. See references if you want to know more about how recommender systems work.
We are going to use Alternating Least Squares method from MLLib, and MovieLens 100K dataset which is only 5 MB in size. Download the dataset from [hide_from_apps container="span"]https://grouplens.org/datasets/movielens/.[/hide_from_apps][show_on